GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:08:51 Sequence gi568815597f:167622152_167887918 : 265767 bp : 43.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8092 8278 187 0 1 136 88 82 0.025 12.26 1.02 Intr + 33287 33412 126 0 0 108 67 29 0.011 3.45 1.03 Intr + 51707 51783 77 0 2 94 36 37 0.023 -1.67 1.04 Intr + 61749 61850 102 2 0 87 105 164 0.758 18.37 1.05 Intr + 63270 63359 90 2 0 111 107 24 0.925 6.79 1.06 Intr + 67898 67969 72 1 0 63 111 49 0.944 4.30 1.07 Intr + 71948 72151 204 1 0 87 94 124 0.931 12.30 1.08 Intr + 74948 75691 744 1 0 84 92 358 0.824 27.45 1.09 Intr + 90363 90468 106 2 1 108 101 -12 0.092 1.89 1.10 Intr + 92262 92437 176 1 2 93 66 80 0.017 5.96 1.11 Intr + 99962 100091 130 1 1 -12 84 141 0.000 3.97 1.12 Intr + 143432 143598 167 0 2 105 103 112 0.969 14.08 1.13 Intr + 150124 150337 214 0 1 71 75 122 0.988 7.49 1.14 Intr + 151085 151217 133 0 1 106 86 54 0.878 6.70 1.15 Intr + 152062 152101 40 1 1 100 83 -10 0.229 -2.07 1.16 Term + 165669 165770 102 2 0 125 48 108 0.986 8.78 1.17 PlyA + 167362 167367 6 1.05 2.25 PlyA - 170122 170117 6 1.05 2.24 Term - 184016 183897 120 2 0 60 43 100 0.457 1.17 2.23 Intr - 188461 188375 87 1 0 -11 81 105 0.396 0.07 2.22 Intr - 188762 188574 189 2 0 89 53 78 0.697 4.18 2.21 Intr - 196116 195921 196 2 1 82 100 45 0.015 4.52 2.20 Intr - 199990 199873 118 2 1 137 52 9 0.131 1.72 2.19 Intr - 200972 200857 116 1 2 107 119 65 0.996 11.59 2.18 Intr - 202421 202325 97 0 1 86 68 5 0.981 -2.63 2.17 Intr - 202704 202500 205 0 1 122 82 247 0.966 26.27 2.16 Intr - 207272 207116 157 1 1 73 91 -16 0.536 -2.79 2.15 Intr - 211011 210836 176 0 2 113 86 96 0.601 10.64 2.14 Intr - 211926 211819 108 0 0 74 94 59 0.625 5.58 2.13 Intr - 214389 214158 232 2 1 28 103 166 0.780 9.88 2.12 Intr - 215167 215098 70 2 1 96 80 38 0.777 2.04 2.11 Intr - 223702 223412 291 2 0 108 25 329 0.868 25.71 2.10 Intr - 224112 223834 279 1 0 78 92 145 0.972 11.35 2.09 Intr - 226338 226210 129 1 0 104 116 36 0.992 8.67 2.08 Intr - 232338 232202 137 2 2 95 89 1 0.973 1.11 2.07 Intr - 234288 234014 275 0 2 75 99 260 0.988 22.14 2.06 Intr - 237742 237656 87 1 0 57 99 34 0.714 1.57 2.05 Intr - 238912 238720 193 0 1 83 93 165 0.792 15.99 2.04 Intr - 256484 256295 190 0 1 81 65 141 0.379 9.74 2.03 Intr - 258040 257964 77 0 2 53 69 68 0.860 0.56 2.02 Intr - 258458 258340 119 2 2 110 81 39 0.983 4.66 2.01 Intr - 261477 261286 192 0 0 58 115 196 0.720 18.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 82599 82676 78 2 0 91 49 101 0.919 4.26 S.002 Sngl - 100156 99941 216 1 0 104 51 203 0.879 11.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:167622152_167887918|GENSCAN_predicted_peptide_1|889_aa GPRGRGSPAAERSRARATAHSPCARRSPKLATAGSGGDSGDRELRPGRAGASAAESAPEG HGGVSYGVAVRKDRARHIISFVPEIIKKSPCYDLRVPSQTPSGQASCYLSFKTLRKHHFL CEAFPDSPKQERPAETNANVDNSASPSVAQLAGRFREQAAAAKETPASKPTRRKPPCSLP LFPPKVDLGQNGEEKSPPNASHPPKFKVKSSPLIEKLQANLTFDPAALLPGASPKSPGLK AMVSPFHSPPSTPSSPGVRSRPSEAEEVPVSFDQPPEGSHLPCYNKVRTRGSIKRRPPSR RFRRSQSDCGELGDFRAVESSQQNGAKEEDGDEVLPSKSKAPGSPLSSEGAAGEGVRTLG PAEKPPLRRSPSRTEKQEEDRATEEAKNGEKARRSSEEVDGQHPAQEEVPESPQTSGPEA ENRCGSPREEKPAGEEAEMEKATEVKGERVQNEEVGPEHDSQETKKLEEGAAVKETPHSP PGGVKGGDVPKQEKGKEKQQEGAVLEPGCSPQTGPAQLETSSEVQSEPAVPKPEAWEIKT TTNKNKQKTLESVYSAKIEVSTSIMFHTHARAPGRPPTLRVEQPGRARKPEGGFRERVRT WRPRRRWGVGTAGPGKSRSLPAPPRCGAGRGGGGGDCSGWTMAASAGAGAVIAAPDSRRW LWSVLAAALGLLTAGVSALEVYTPKEIFVANGTQGKLTCKFKSTSTTGGLTSVSWSFQPE GADTTVSFFHYSQGQVYLGNYPPFKDRISWAGDLDKKDASINIENMQFIHNGTYICDVKN PPDIVVQPGHIRLYVVEKENLPVFPVWVVVGIVTAVVLGLTLLISMILAVLYRRKNSKRD YTGWKANPYKLCRTHVGPVIYAQLDHSGGHHSDKINKSESVVYADIRKN >gi568815597f:167622152_167887918|GENSCAN_predicted_CDS_1|2670_bp ggcccccgcggccggggcagtcccgcagccgagcgcagccgggcgcgcgccaccgcccac tcgccctgtgcccgccgcagcccgaaactggccacggccgggagcggaggggacagcggg gatcgtgagctccggcccgggcgagcgggtgcgtctgccgcagagtcggcacctgaagga catggaggtgtttcctatggtgttgctgtcagaaaggacagagctcgtcacataatcagc tttgttcctgagattatcaaaaagtctccctgctatgacctgcgtgtgccatcccaaacc ccctctgggcaagcaagctgttatctgtccttcaagactctgcgtaaacatcacttcctc tgtgaagcgttccctgattcacctaagcaggaaagaccggcagagaccaatgccaatgtg gacaactcggcgtccccctcggtggcccagctggccgggcggtttagggagcaggcggct gcagccaaggagacaccagccagtaaaccaacccgaaggaaaccgccctgttccctcccc ctgttcccccccaaggtagacctgggccagaatggtgaggagaaatcaccacccaatgcg agccaccctcctaaattcaaggtcaagagctcgcctctgattgagaagcttcaggccaat ttaacctttgacccagctgctctactgcctggggcctcacccaagagtcctggactcaag gctatggtgtcgccatttcacagcccaccttctacccccagcagccctggtgtgcgatct aggcccagcgaggcagaggaggtgcctgtcagcttcgaccagccccctgaaggcagtcat ctgccctgttacaacaaggtgcggacgaggggctcaataaaaaggcgccctccctccagg cgattccgaaggtcacagtcagactgtggagaacttggagatttcagggcggtggagtca tctcagcagaacggtgctaaggaagaggatggggatgaagtgttgccatccaagagcaag gccccaggatcccctttgtccagtgagggagcagcgggagagggagtgagaaccctggga cctgctgaaaagcctcctctgaggaggtcacccagcaggacagagaagcaggaggaggac agggccacagaggaagccaagaacggtgaaaaggccaggcggagttcagaggaggtggac ggccagcacccggcccaagaggaggtcccggaatcgccccagacctctggcccagaggca gaaaataggtgtgggagccccagggaggaaaagccagctggagaggaagcagagatggaa aaggctacagaggtgaagggggagagggtgcaaaatgaagaggtgggacctgaacatgac agccaagaaacaaagaagctggaggagggagctgcagtgaaggagaccccccacagtccc cctggaggagtgaagggcggagatgtccccaagcaggaaaaaggcaaggaaaaacaacag gagggggcagtgctcgagccaggctgcagcccccagaccggccctgcccagctggagacc agcagtgaggtccagagcgagccagcagtccccaagccggaggcttgggaaattaaaaca acaacgaacaaaaacaaacaaaaaacccttgagagtgtatactcagcaaagatagaagtg tccacctctatcatgttccacacacatgcccgggctcccgggcgcccgccaacactgcga gtggaacagccggggcgagcccgtaagccggaaggcggcttccgggagcgcgtgcgaacg tggaggccacgccggcgatggggggttggaacggccggaccaggcaaaagccggtccctc cctgctcctccacgctgtggggcaggacgcggcggcggcggcggcgactgcagtggctgg acgatggcagcgtccgccggagccggggcggtgattgcagccccagacagccggcgctgg ctgtggtcggtgctggcggcggcgcttgggctcttgacagctggagtatcagccttggaa gtatatacgccaaaagaaatcttcgtggcaaatggtacacaagggaagctgacctgcaag ttcaagtctactagtacgactggcgggttgacctcagtctcctggagcttccagccagag ggggccgacactactgtgtcgtttttccactactcccaagggcaagtgtaccttgggaat tatccaccatttaaagacagaatcagctgggctggagaccttgacaagaaagatgcatca atcaacatagaaaatatgcagtttatacacaatggcacctatatctgtgatgtcaaaaac cctcctgacatcgttgtccagcctggacacattaggctctatgtcgtagaaaaagagaat ttgcctgtgtttccagtttgggtagtggtgggcatagttactgctgtggtcctaggtctc actctgctcatcagcatgattctggctgtcctctatagaaggaaaaactctaaacgggat tacactggctggaaagctaacccatataagctatgtagaactcatgtgggcccagtcata tatgcacagttagaccactccggcggacatcacagtgacaagattaacaagtcagagtct gtggtgtatgcggatatccgaaagaattaa >gi568815597f:167622152_167887918|GENSCAN_predicted_peptide_2|1279_aa IDNKQLQGYLSELRPVTIVFVNLMFEDQDKAEEIGPAIQDAYMHITSVLKIFQGQINKVF MFDKGCSFLCVFGFPGEKVPDELTHALECAMDIFDFCSQVHKIQTVSIGVASGIVFCGIV GHTVRHEYTVIGQKVNLAARMMMYYPGIVTCDSVTYNGSNLPAYFFKELPKKVMKGVADS GPLYQYWGRTEKVIIAISLNKISFHQTFYTIQMFMANVLGLDTCKHYKERQTNLRNKVMT LLDEKFYCLLNDIFHVQFPISREISRMSTLKKQKQLEILFMKILKLIVKEERIIFIIDEA QFVDSTSWRFMEKLIRTLPIFIIMSLCPFVNIPCAAARAVIKNRNTTYIVIGAVQPNDIS NKICLDLNVSCISKELDSYLGEGSCGIPFYCEELLKNLEHHEVLVFQQTESEEKTNRTWN NLFKYSIKLTEKLNMVTLHSDKESEEVCHLTSGVRLKNLSPPTSLKEISLIQLDSMRLSH QMLVRCAAIIGLTFTTELLFEILPCWNMKMMIKTLATLVESNIFYCFRNGKELQKALKQN DPSFEVHYRSLSLKPSEGMDHGEEEQLRELENEVIECHRIRFCNPMMQKTAYELWLKDQR KAMHLKCARFLEEDAHRCDHCRGRDFIPYHHFTVNIRLNALDMDAIKKMAMSHGFKTEEK LILSNSEIPETSAFFPENRSPEEIREKILNFFDHVLTKMKTSDEDIIPLESCQCEEILEI VILPLAHHFLALGENDKALYYFLEIASAYLIFCDNYMAYMYLNEGQKLLKTLKKDKSWSQ TFESATFYSLKGEVCFNMGQIVLAKKMLRKALKLLNRIFPYNLISLFLHIHVEKNRHFHY VNRQAQESPPPGKKRLAQLYRQTVCLSLLWRIYSYSYLFHCKYYAHLAVMMQMNTALETQ NCFQIIKAYLDYSLYHHLAGYKGVWFKYEVMAMEHIFNLPLKGEGIEIVAYVAETLVFNK LIMGHLDLAIELGSRALQMWALLQNPNRHYQSLCRLSRCLLLNSRYPQLIQVLGRLWELS VTQEHIFSKAFFYFVCLDILLYSGFVYRTFEECLEFIHQYENNRILKFHSGLLLGLYSSV AIWYARLQEWDNFYKFSNRAKNLLPRRTMTLTYYDGISRYMEGQVLHLQKQIKEQSENAQ ASGEELLKNLENLVAQNTTGPVFCPRLYHLMAYVCILMGDGQKCGLFLNTALRLSETQGN ILEKCWLNMNKTGSLMNNRNLLLSFGSWKSKIKALEIRCLLRRDNDHRERAMMTMVVLSN RKGGNVGKRKRDQIVIVSV >gi568815597f:167622152_167887918|GENSCAN_predicted_CDS_2|3840_bp attgataacaaacagcttcagggctatttatctgagcttcgcccagtgacgattgtgttt gtgaacctgatgtttgaagaccaagacaaagcagaagagataggcccagccatccaggat gcctatatgcacatcacttctgtcctgaagatcttccaaggccaaatcaataaagtcttc atgtttgacaagggctgctctttcctctgtgtctttggcttccctggggaaaaggtacct gacgagctcactcatgctctggaatgtgctatggatatatttgacttctgctctcaagtc cacaaaatccaaactgtatccatcggtgttgccagtgggattgtcttctgtgggatcgtt ggacacactgtgagacacgagtacacagtcattggtcaaaaagtcaacttagctgccagg atgatgatgtactacccaggaattgtgacctgcgactctgtcacctacaatgggagcaac ctaccagcgtacttttttaaagagcttccaaagaaagttatgaaaggtgttgcagattct ggaccattgtatcagtattggggccgtactgagaaagtgattattgccatttcattgaat aagatcagcttccatcaaactttctataccatccagatgttcatggccaatgtcctaggc ctagacacttgtaaacattataaagaacgacagaccaaccttcgaaataaagtcatgaca ctgttggatgaaaagttctactgtcttcttaatgacattttccatgttcagttccctatt tctcgggagatttccaggatgagcaccttgaaaaagcaaaaacaattggaaatattgttt atgaagatcttgaagctgatagtgaaagaggaaaggattatttttatcattgatgaggcc cagtttgtggattcgacctcctggagatttatggagaagcttatccggactcttcctatc ttcatcattatgtccctgtgtcccttcgttaacattccctgtgcagctgccagggccgta ataaagaacaggaacaccacctacattgtcattggtgcagtacagcctaacgacatctcc aacaagatctgtcttgacctcaatgtgagctgcatctccaaagaactggactcgtacctg ggggagggaagctgtgggattccattttactgtgaagaattgcttaaaaacctggaacat catgaggtactcgttttccaacaaacggagtctgaggaaaagacaaataggacctggaat aacctgttcaagtattccattaagctaacagagaagttaaacatggttactctccatagt gataaggaaagtgaagaagtctgtcacctcacaagtggtgtcagactgaaaaacctgtca cctccaacgtcattaaaagaaatctctctgatccagctggatagcatgagactttcccac caaatgctggtgagatgtgctgccatcattggcctgaccttcaccactgagttgttgttt gagattctcccctgttggaatatgaagatgatgatcaagaccctggcaaccctagtggaa tctaacattttttattgtttccggaatggcaaggagcttcaaaaggccctgaaacagaat gatccctcatttgaggtgcactatcgttccttgtctctgaagcccagtgaagggatggat cacggtgaagaggaacagcttcgtgaactggagaatgaggtgatcgagtgccacaggatt cgattctgtaaccctatgatgcagaaaacagcctacgagctgtggctcaaggaccagaga aaagccatgcacttgaaatgtgcccgctttttagaagaagatgcccacagatgtgaccac tgccgaggcagggacttcattccctatcatcacttcacagtgaatattcggctcaacgct ttagacatggatgccattaaaaagatggctatgtctcatggatttaaaactgaagaaaag cttatcttgtccaactcagagattcctgagacatctgcattttttcctgaaaatcgcagt cctgaagaaataagagaaaagatcttgaatttctttgaccacgttttaacaaaaatgaag acatctgacgaagacattatccctctggaatcttgccagtgtgaagaaatcctagagatt gtcatcttgcctctggcccaccattttctggctttgggagaaaatgacaaagccttatat tacttcttagaaattgcatctgcttatctcatcttttgtgataactacatggcatacatg tatttgaatgaaggacagaagttgctaaaaactctcaagaaggacaaatcttggagccag acatttgagtctgccaccttttacagcctcaaaggtgaggtctgtttcaatatgggccag atagtgcttgccaagaaaatgctgaggaaggcactgaagctcctcaaccgaatctttcct tacaacttaatctccttgtttctccatatccatgtcgagaaaaacagacactttcattat gtgaatcggcaggcccaagagagcccacctccagggaagaagaggctggcacaactttac cggcaaactgtctgcctttccttgctgtggcgcatctatagctacagttatctttttcac tgcaagtattatgcccacctggcagttatgatgcaaatgaatactgcactggaaactcaa aattgtttccagatcattaaggcttacctagactattcgctataccaccacctggctggc tacaaaggtgtgtggttcaaatatgaagtcatggccatggagcacatcttcaacctcccc ctgaaaggcgagggcattgaaatcgtggcatacgtggctgagacactggtcttcaacaag ctcataatgggacacctggatttggccattgagttaggctcccgagcccttcagatgtgg gcactgctccagaatcccaaccgacattatcagtccctctgcagacttagcagatgtctc cttctgaacagcagatacccgcaattgatccaggtgctggggcggctgtgggagctttct gtaacacaggaacacatcttcagcaaggcatttttctattttgtctgcttggacatcctg ctttattctggttttgtttatagaacatttgaagaatgtttggaattcatacaccaatac gaaaacaacagaatcctcaagttccacagtggactcctcctgggactttattcctctgta gctatctggtatgccagacttcaggaatgggacaacttttacaaattttccaatagagct aaaaatcttttgccaagaagaaccatgacacttacttactatgacggaatatctaggtac atggaggggcaagttcttcaccttcaaaaacaaatcaaagaacagtcagagaatgcccaa gccagtggggaggagctactcaagaacttggagaatctggtggctcaaaataccactggc cctgtcttttgcccaaggctctaccacctgatggcttacgtctgtatattaatgggagat gggcagaaatgtggcctcttcctgaacacagccttgcggctctctgaaacacaggggaat atactggagaaatgctggctgaacatgaacaaaactgggtcattgatgaacaatagaaac ctactgctcagttttggaagctggaagtccaagatcaaggctctggagattcgctgtctg ctccgaagagacaacgaccatcgagaacgggccatgatgacgatggtggttttgtcgaat agaaaagggggaaatgtggggaaaagaaagagagatcagattgttattgtgtctgtgtag