GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:17:34 Sequence gi568815593f:126500963_126724841 : 223879 bp : 43.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 PlyA - 70 65 6 1.05 1.15 Term - 7981 7770 212 2 2 54 38 102 0.088 -0.74 1.14 Intr - 25234 25081 154 1 1 71 57 78 0.117 2.65 1.13 Intr - 51175 51059 117 1 0 60 93 57 0.938 4.06 1.12 Intr - 53431 53325 107 2 2 115 116 78 0.999 13.23 1.11 Intr - 55053 54969 85 0 1 57 84 98 0.990 5.69 1.10 Intr - 58372 58278 95 2 2 29 105 55 0.909 0.98 1.09 Intr - 60162 60121 42 1 0 83 87 37 0.673 1.31 1.08 Intr - 67394 67297 98 1 2 98 111 85 0.978 11.45 1.07 Intr - 69897 69820 78 2 0 136 69 60 0.991 7.57 1.06 Intr - 74502 74458 45 2 0 60 106 42 0.622 0.72 1.05 Intr - 76249 76117 133 2 1 123 99 155 0.999 19.80 1.04 Intr - 82012 81889 124 1 1 89 80 94 0.934 8.86 1.03 Intr - 91767 91702 66 0 0 73 95 31 0.715 1.40 1.02 Intr - 92442 92389 54 0 0 84 110 46 0.967 5.58 1.01 Init - 94236 94045 192 0 0 37 85 173 0.843 9.10 1.00 Prom - 94809 94770 40 -7.16 2.00 Prom + 99252 99291 40 -7.46 2.01 Init + 99863 100096 234 1 0 41 81 361 0.995 26.84 2.02 Intr + 102608 103221 614 1 2 111 99 325 0.961 27.08 2.03 Intr + 107402 107522 121 2 1 52 95 49 0.946 2.50 2.04 Intr + 116288 116371 84 1 0 88 90 77 0.992 7.92 2.05 Term + 123613 123882 270 0 0 87 42 224 0.999 13.18 2.06 PlyA + 123931 123936 6 1.05 3.02 PlyA - 126007 126002 6 1.05 3.01 Sngl - 127357 127055 303 1 0 86 40 351 0.999 25.93 3.00 Prom - 161079 161040 40 -2.56 4.02 PlyA - 161134 161129 6 -0.45 4.01 Sngl - 162688 162425 264 1 0 88 42 185 0.929 7.11 4.00 Prom - 173190 173151 40 -2.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:126500963_126724841|GENSCAN_predicted_peptide_1|533_aa MWRLPRALCVHAAKTSKLSGPWSRPAAFMSTLLINQPQYAWLKELGLREENEGVYNGSWG GRGEVITTYCPANNEPIARVRQASVADYEETVKKAREAWKIWADVSLEMGKILVEGVGEV QEYVDICDYAVGLSRMIGGPILPSERSGHALIEQWNPVGLVGIITAFNFPVAVYGWNNAI AMICGNVCLWKGAPTTSLISVAVTKIIAKVLEDNKLPGAICSLTCGGADIGTAMAKDERV NLLSFTGSTQVGKQVGLMVQERFGRSLLELGGNNAIIAFEDADLSLVVPSALFAAVGTAG QRCTTARRLFIHESIHDEVVNRLKKAYAQIRVGNPWDPNVLYGPLHTKQAVSMFLGAVEE AKKEGGTVVYGGKVMDRPGNYVEPTIVTGLGHDASIAHTETFAPILYVFKFKLSSPDSEP KGVEEKPFLFYSRKEVGIFDILPRKRKKRRKPVGLAWSETGDVGAEEPEAIERGANRSSS SACELDLGHLGASLLREPTVWWFQGPRLLVWDHKYFLISRNALSLRSALDTHY >gi568815593f:126500963_126724841|GENSCAN_predicted_CDS_1|1602_bp atgtggcgccttcctcgcgcgctgtgtgtgcacgctgcaaagaccagcaagctctctgga ccttggagcaggcctgccgccttcatgtccactctcctcatcaatcagccccagtatgcg tggctgaaagagctggggctccgcgaggaaaacgagggcgtgtataatggaagctgggga ggccggggagaggttattacgacctattgccctgctaacaacgagccaatagcaagagtc cgacaggccagtgtggcagactatgaagaaactgtaaagaaagcaagagaagcatggaaa atctgggcagatgtgtctttggagatggggaaaatcttagtggaaggtgtgggtgaagtt caggagtatgtggatatctgtgactatgctgttggtttatcaaggatgattggaggacct atcttgccttctgaaagatctggccatgcactgattgagcagtggaatcccgtaggcctg gttggaatcatcacggcattcaatttccctgtggcagtgtatggttggaacaacgccatc gccatgatctgtggaaatgtctgcctctggaaaggagctccaaccacttccctcattagt gtggctgtcacaaagataatagccaaggttctggaggacaacaagctgcctggtgcaatt tgttccttgacttgtggtggagcagatattggcacagcaatggccaaagatgaacgagtg aacctgctgtccttcactgggagcactcaggtgggaaaacaggtgggcctgatggtgcag gagaggtttgggagaagtctgttggaacttggaggaaacaatgccattattgcctttgaa gatgcagacctcagcttagttgttccatcagctctcttcgctgctgtgggaacagctggc cagaggtgtaccactgcgaggcgactgtttatacatgaaagcatccatgatgaggttgta aacagacttaaaaaggcctatgcacagatccgagttgggaacccatgggaccctaatgtt ctctatgggccactccacaccaagcaggcagtgagcatgtttcttggagcagtggaagaa gcaaagaaagaaggtggcacagtggtctatgggggcaaggttatggatcgccctggaaat tatgtagaaccgacaattgtgacaggtcttggccacgatgcgtccattgcacacacagag acttttgctccgattctctatgtctttaaattcaagctatcatctccagactcagaacct aagggggtagaggaaaagcctttcctcttctacagcaggaaggaggttggcatatttgat attcttcccaggaagaggaaaaagaggaggaagccagtggggctggcttggagtgaaaca ggggatgtgggtgcagaagaaccagaagcaatagaaagaggagcaaatcgatcatcatct agtgcctgtgagctggatttgggtcacctgggtgcctccctgcttcgagaaccaacagtg tggtggtttcaaggacctcggcttctggtttgggaccacaaatatttcctaatctctaga aatgctctcagtttacgatctgctttagacacgcactactaa >gi568815593f:126500963_126724841|GENSCAN_predicted_peptide_2|440_aa MGRLRPLGQAGLRAGAPEQPSRRAFRLPRRKCSPDPPLCSAAHRGKMALEVGDMEDGQLS DSDSDMTVAPSDRPLQLPKVLGGDSAMRAFQNTATACAPVSHYRAVESVDSSEESFSDSD DDSCLWKRKRQKCFNPPPKPEPFQFGQSSQKPPVAGGKKINNIWGAVLQEQNQDAVATEL GILGMEGTIDRSRQSETYNYLLAKKLRKESQEHTKDLDKELDEYMHGGKKMGSKEEENGQ GHLKRKRPVKDRLGNRPEMNYKGRYEITAEDSQEKVADEISFRLQEPKKDLIARVVRIIG NKKAIELLMETAEVEQNGGLFIMNGSRRRTPGGVFLNLLKNTPSISEEQIKDIFYIENQK EYENKKAARKRRTQVLGKKMKQAIKSLNFQEDDDTSRETFASDTNEALASLDESQEGHAE AKLEAEEAIEVDHSHDLDIF >gi568815593f:126500963_126724841|GENSCAN_predicted_CDS_2|1323_bp atgggacgcctcaggccgctgggacaggctggcctccgcgcgggggcgcccgagcagccg agccgccgggccttccggctgccccgccggaagtgctctcctgacccgccgctgtgcagc gcagcgcaccgcgggaagatggcgttggaggtcggcgatatggaagatgggcagctttcc gactcggattccgacatgacggtcgcacccagcgacaggccgctgcaattgccaaaagtg ctaggtggcgacagtgctatgagggccttccagaacacggcaactgcatgtgcaccagta tcacattatcgagctgttgaaagtgtggattcaagtgaagaaagtttttctgattcagat gatgatagctgtctttggaaacgcaaacgacagaaatgttttaaccctcctcccaaacca gagccttttcagtttggccagagcagtcagaaaccacctgttgctggaggaaagaagatt aacaacatatggggtgctgtgctgcaggaacagaatcaagatgcagtggccactgaactt ggtatcttgggaatggagggcactattgacagaagcagacaatccgagacctacaattat ttgcttgccaagaaacttaggaaggaatctcaagagcatacaaaagatctagacaaggaa ctagatgaatatatgcatggtggcaaaaaaatgggatcaaaggaagaggaaaatgggcaa ggtcatctcaaaaggaaacgacctgtcaaagacaggctagggaacagaccagaaatgaac tataaaggtcgatacgagatcacagcggaagattctcaagagaaagtggctgatgaaatt tcattcaggttacaggaaccaaagaaagacctgatagcccgagtagtgaggattattggt aacaaaaaggcaattgaacttctgatggaaaccgctgaagttgaacaaaatggtggtctc tttataatgaatggtagtcgaagaagaacaccaggtggagtttttctgaatctcttgaaa aacactcctagtatcagcgaggaacaaattaaggacattttctacattgaaaaccaaaag gaatatgaaaataaaaaagctgctaggaagaggagaacacaagtgttggggaaaaagatg aaacaagctattaaaagtctaaattttcaagaagatgatgatacatcacgagaaactttt gcaagtgacacgaatgaggccttggcctctcttgatgagtcacaggaaggacatgcagaa gccaagttggaggcagaggaagccattgaagttgatcattctcatgatttggacatcttt taa >gi568815593f:126500963_126724841|GENSCAN_predicted_peptide_3|100_aa MASLSELAYIYSALILHNDKINALIKAAGVNVEPFWPGVFAKALANVSIGNPICNVRAGG LAAAGDPTPSTAAASVEKKVEAKKEESEESDEDMGFGLFD >gi568815593f:126500963_126724841|GENSCAN_predicted_CDS_3|303_bp atggcctccctctccgagctcgcctacatctactcggccctcattctgcataatgataaa atcaatgccctcattaaagcagctggtgtaaatgttgaacctttctggcctggcgtgttt gcaaaggccctagccaatgtcagcatcgggaaccccatctgcaatgtaagggctggtgga cttgcagcagctggagatcctaccccctccactgctgctgcttcagttgagaagaaagtg gaagcaaagaaagaagaatccgaggaatctgatgaggacatgggctttggtctttttgac taa >gi568815593f:126500963_126724841|GENSCAN_predicted_peptide_4|87_aa MAAWSPAAAAPLLRGICGLPLHHGMFATQTEGELRVTQILKEKFPRATAIKVTDISGVVG RCMKLKLNQKNLRRRELSSSTRWFIRH >gi568815593f:126500963_126724841|GENSCAN_predicted_CDS_4|264_bp atggcggcatggagcccggccgcagcagcgcctctgctccgcgggatctgcgggcttcca cttcaccatgggatgtttgccacccagactgagggggagctcagagtgacccaaattctc aaagaaaagtttccacgagctacagctatcaaagtcactgacatttctggagttgtggga cgatgtatgaaattaaaattgaatcagaagaatttaaggagaagagaactgtccagcagc accagatggtttatcaggcactaa