GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:13:05 Sequence gi568815590r:102550135_102752398 : 202264 bp : 46.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1594 1913 320 0 2 82 37 98 0.086 0.91 1.02 Term + 10318 10750 433 1 1 115 39 280 0.983 20.57 1.03 PlyA + 13453 13458 6 1.05 2.00 Prom + 20590 20629 40 -2.36 2.01 Init + 26004 26030 27 2 0 70 91 12 0.120 -0.54 2.02 Term + 37806 37943 138 2 0 109 53 87 0.711 5.26 2.03 PlyA + 38439 38444 6 1.05 3.00 Prom + 66758 66797 40 -2.16 3.01 Init + 72561 72629 69 2 0 63 77 77 0.169 5.15 3.02 Term + 77503 77610 108 0 0 67 55 76 0.111 0.71 3.03 PlyA + 79092 79097 6 1.05 4.06 PlyA - 82112 82107 6 1.05 4.05 Term - 84246 84163 84 0 0 132 48 42 0.179 2.25 4.04 Intr - 100257 100052 206 1 2 104 55 120 0.178 9.22 4.03 Intr - 101860 101015 846 2 0 23 81 337 0.273 17.85 4.02 Intr - 105619 105432 188 0 2 40 115 176 0.900 14.83 4.01 Init - 110754 110567 188 0 2 86 100 53 0.436 4.83 4.00 Prom - 116279 116240 40 -5.26 5.00 Prom + 138092 138131 40 -0.66 5.01 Init + 138853 138922 70 0 1 96 78 49 0.771 5.92 5.02 Intr + 162381 162466 86 1 2 81 40 59 0.517 -0.06 5.03 Intr + 162771 162866 96 2 0 81 100 55 0.792 6.21 5.04 Term + 165112 165228 117 0 0 108 43 29 0.550 -1.06 5.05 PlyA + 166020 166025 6 1.05 6.04 PlyA - 166072 166067 6 1.05 6.03 Term - 166335 166213 123 0 0 27 41 132 0.183 0.78 6.02 Intr - 166587 166537 51 0 0 39 131 28 0.137 1.30 6.01 Init - 183317 183258 60 2 0 82 73 158 0.471 12.97 6.00 Prom - 184164 184125 40 -7.26 7.06 PlyA - 185622 185617 6 1.05 7.05 Term - 187034 187027 8 0 2 130 44 0 0.138 -2.17 7.04 Intr - 188275 188110 166 1 1 96 45 58 0.147 1.83 7.03 Intr - 189561 189425 137 1 2 55 58 86 0.103 2.59 7.02 Intr - 197597 197464 134 1 2 115 103 -2 0.274 4.29 7.01 Init - 200328 200294 35 0 2 71 65 52 0.142 0.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:102550135_102752398|GENSCAN_predicted_peptide_1|250_aa MAALSCLLDSVRRDIKKVDRELRQLRCIDEFSTRCLCDLYMHPYCCCDLHPYPYCLCYSK RSRSCGLCDLYPCCLCDYKLYCLRPSLRSLERKAIRAIEDEKRELAKLRRTTNRILASSC CSSNILGSVNVCGFEPDQVKVRVKDGKVCVSAERENRYDCLGSKKYSYMNICKEFSLPPC VDEKDVTYSYGLGSCVKIESPCYPCTSPCSPCSPCSPCNPCSPCNPCSPYDPCNPCYPCG SRFSCRKMIL >gi568815590r:102550135_102752398|GENSCAN_predicted_CDS_1|753_bp atggctgcactgagttgtctcttggacagtgtcagaagggacataaagaaggtggacaga gaactaaggcaactgagatgcatcgacgaatttagcacacggtgcctgtgcgacttgtat atgcacccctattgctgctgtgacttgcacccatatccgtactgcttgtgctattccaag cgatcacgctcttgcggcctgtgtgatctctacccatgttgcctgtgtgattataagctt tactgtctgcgaccatctctcagaagtttggagaggaaagccatcagagccatagaagat gagaagcgagagcttgccaaactgagaagaacaacaaatagaattctggcttcctcctgc tgtagcagtaacattttaggatcggtgaatgtatgcggttttgaacccgatcaagtcaaa gttcgagtgaaggatggaaaggtatgtgtgtcggctgagcgggagaacaggtacgactgc cttggatcgaaaaagtacagctacatgaacatctgcaaagagttcagcttgccgccctgt gtggatgagaaggatgtaacatactcctatgggctcggcagctgtgtcaagatcgagtct ccttgctacccttgcacttctccttgcagcccctgcagcccctgcagcccctgcaacccc tgcagcccctgcaacccgtgcagcccatatgatccttgcaacccgtgttatccctgtgga agccgattttcctgtaggaagatgattttgtaa >gi568815590r:102550135_102752398|GENSCAN_predicted_peptide_2|54_aa MGTRNALVMDSPSKNFYKRDKMLKGLEEAKVNFVWRTNEGFLEEVALDLAHPDL >gi568815590r:102550135_102752398|GENSCAN_predicted_CDS_2|165_bp atgggcactaggaatgcactggtgatggactcaccctccaaaaatttctataaaagggat aaaatgctcaagggtttagaggaggctaaggttaattttgtctggagaaccaatgagggc ttcctggaggaggtagcattggatctagctcatccagatctctga >gi568815590r:102550135_102752398|GENSCAN_predicted_peptide_3|58_aa MSGTLAGVKSFRPRAAIQKLLTLLTPQVVTVKHADSQRLHLTKFFPEGFEARAVSLPG >gi568815590r:102550135_102752398|GENSCAN_predicted_CDS_3|177_bp atgtctgggacactggctggtgtaaagagcttcagaccaagagctgccatccagaagctc ctcaccttgctgacaccccaggtggtcaccgtgaagcatgcagattctcagaggctgcat ctcaccaagttcttccccgaagggtttgaggccagggcagtgtctctcccaggctga >gi568815590r:102550135_102752398|GENSCAN_predicted_peptide_4|503_aa MVEGEEAQSTSKSRENHLIKPSDLMRTYSLSREQHGGNCPDDPVAFHLIPNSTSGDYGDY NSRLSGYLNGEGADGAERGGGGSGVECLRAPVCGQAASSLAASQLAAGGQAANHAQLRCL SPADCAPSTVHFKSLSDTAKPHIAAPFKEEEKSPVSAPKLPKAQATSVIRHTADAQLCNH QTCPMKAASILNYQNNSFRRRTHLNVEAARKNIPCAAVSPNRSKCERNTVADVDEKASAA LYDFSVPSSETVICRSQPAPVSPQQKSVLVSPPAVSAGGVPPMPVICQMVPLPANNPVVT TVVPSTPPSQPPAVCPPVVFMGTQVPKGAVMFVVPQPVVQSSKPPVVSPNGTRLSPIAPA PGFSPSAAKVTPQIDSSRIRSHICSHPGCGKTYFKSSHLKAHTRTHTGEKPFSCSWKGCE RRFARSDELSRHRRTHTGEKKFACPMCDRRFMRSDHLTKHARRHLSAKKLPNWQMEASAG TDWPDLGGVGETAHEWGYPQDAG >gi568815590r:102550135_102752398|GENSCAN_predicted_CDS_4|1512_bp atggtggaaggggaagaggcacagagcacaagcaagagcagggaaaaccaccttataaaa ccatcagatctcatgagaacttactcactatcacgagaacagcatgggggaaactgcccc gatgatccagtcgccttccacctgatccccaactcgacaagtggggattatggggattac aattcaaggctgagcggctacctgaatggggagggggcagacggcgctgagcgcggcggc ggcgggagcggcgtcgagtgtctccgtgcgcccgtctgtggccaagcagccagcagccta gcagccagtcagcttgccgccggcggccaagcagccaaccatgctcaacttcggtgcctc tctccagcagactgcgcgccatctactgtacacttcaagtcactctcagatactgccaaa cctcacattgccgcacctttcaaagaggaagaaaagagcccagtatctgcccccaaactc cccaaagctcaggcaacaagtgtgattcgtcatacagctgatgcccagctatgtaaccac cagacctgcccaatgaaagcagccagcatcctcaactatcagaacaattcttttagaaga agaacccacctaaatgttgaggctgcaagaaagaacataccatgtgccgctgtgtcacca aacagatccaaatgtgagagaaacacagtggcagatgttgatgagaaagcaagtgctgca ctttatgacttttctgtgccttcctcagagacggtcatctgcaggtctcagccagcccct gtgtccccacaacagaagtcagtgttggtctctccacctgcagtatctgcagggggagtg ccacctatgccggtcatctgccagatggttccccttcctgccaacaaccctgttgtgaca acagtcgttcccagcactcctcccagccagccaccagccgtttgcccccctgttgtgttc atgggcacacaagtccccaaaggcgctgtcatgtttgtggtaccccagcccgttgtgcag agttcaaagcctccggtggtgagcccgaatggcaccagactctctcccattgcccctgct cctgggttttccccttcagcagcaaaagtcactcctcagattgattcatcaaggataagg agtcacatctgtagccacccaggatgtggcaagacatactttaaaagttcccatctgaag gcccacacgaggacgcacacaggagaaaagcctttcagctgtagctggaaaggttgtgaa aggaggtttgcccgttctgatgaactgtccagacacaggcgaacccacacgggtgagaag aaatttgcgtgccccatgtgtgaccggcggttcatgaggagtgaccatttgaccaagcat gcccggcgccatctatcagccaagaagctaccaaactggcagatggaagcctctgctggt acagactggccagatttaggtggagtgggagagacagcccatgaatggggatacccgcaa gatgcaggctga >gi568815590r:102550135_102752398|GENSCAN_predicted_peptide_5|122_aa MASDALAEQAQQAEVAWDSGCWRGPFVVARWLPETVGIILSTSMRKEKTLDWLAAFYILV PNSKKPESSEVDPGLLKHWEVIVKTKLPFTCLRSHLITLGLKIMTSFHKLFLAPCLGIPG PR >gi568815590r:102550135_102752398|GENSCAN_predicted_CDS_5|369_bp atggcctcagacgccctggcagagcaggcccagcaagcagaggttgcctgggacagcggc tgctggagagggccctttgtggttgcaagatggctaccagagacagttggcattatcttg tccacatctatgaggaaagagaagacgcttgactggttggctgcattctacattctggtg cctaattccaagaaacctgagagttctgaggttgatcctggccttctaaagcactgggaa gtcattgtcaagacaaagctaccttttacctgtctccggtcacatctgatcacactgggc ttaaaaatcatgaccagtttccacaagctcttcctggctccttgcctgggcatcccaggc cctcggtga >gi568815590r:102550135_102752398|GENSCAN_predicted_peptide_6|77_aa MLLLLLLLLLLGCFHLGQWNSLKLTLLALGHPTAIAPECEVCYTRAEPAQCYGRLGPDQQ ESCPAEEVELVLVPECS >gi568815590r:102550135_102752398|GENSCAN_predicted_CDS_6|234_bp atgctgctgctcctgctgctgctgctgctgcttggctgttttcatctgggccagtggaat tctttgaaactcaccttgctggctttgggacaccccacggcgatcgctccggagtgtgag gtgtgctacaccagggctgaacctgcccagtgctacgggcgtctgggacctgaccaacag gagagctgcccggctgaggaagtagaactggtgctggttccagagtgttcatga >gi568815590r:102550135_102752398|GENSCAN_predicted_peptide_7|159_aa MAVESFNDEPQSPGQPYIRIGFHLGGVRTPRVLGQEGGGQVQTQAAGKVRLMERAAEGAF AVQGTEPVAMPSSDLCDVTRLAPIGIGAFDCCYRTEVLNILKPWCSHSPATSSWMEGWMD RPTPSPGFHDTALSHISRFVTAHFLLTGIAFSQLLTVEQ >gi568815590r:102550135_102752398|GENSCAN_predicted_CDS_7|480_bp atggcagttgagagcttcaatgatgagccccagagcccgggccagccttacattaggatt gggtttcaccttggtggcgtcaggacccccagagtccttggtcaggagggtggggggcag gtgcagacccaggcagcaggaaaggtgcggctcatggagcgggcagcagaaggagcattt gctgtccagggcactgagcctgtggccatgccctcatctgacctctgtgatgtcacccgt ctggcccctataggcattggggcttttgactgttgctacagaacagaagttcttaacatt ttgaagccctggtgctcccacagcccagccacctcttcctggatggagggatggatggac cgacctactccctcccctggtttccatgacactgcactttcccacatctccaggtttgtg acagctcatttcctcctcactggcatcgccttctcccagcttctcaccgtggaacagtaa