GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:43:21 Sequence gi568815597f:148648953_148849495 : 200543 bp : 40.09% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7915 8054 140 0 2 86 115 88 0.659 10.96 1.02 Term + 14444 14528 85 2 1 70 40 81 0.368 -2.35 1.03 PlyA + 15220 15225 6 1.05 2.06 PlyA - 15832 15827 6 1.05 2.05 Term - 20557 20370 188 2 2 58 55 71 0.247 -2.53 2.04 Intr - 25748 25636 113 1 2 81 101 63 0.561 6.00 2.03 Intr - 30239 29994 246 1 0 80 65 93 0.080 1.85 2.02 Intr - 30612 30510 103 1 1 46 101 211 0.005 16.51 2.01 Init - 41125 40990 136 1 1 76 57 115 0.011 7.55 2.00 Prom - 44407 44368 40 -8.55 3.00 Prom + 46829 46868 40 -6.55 3.01 Init + 49185 49381 197 2 2 67 64 128 0.591 6.75 3.02 Intr + 56533 56639 107 1 2 107 63 40 0.290 2.34 3.03 Intr + 63548 63670 123 0 0 50 92 61 0.190 2.34 3.04 Intr + 66393 66520 128 1 2 102 61 66 0.439 4.78 3.05 Intr + 72941 73078 138 1 0 58 98 30 0.271 0.74 3.06 Intr + 73608 73769 162 2 0 83 74 92 0.206 6.55 3.07 Intr + 82055 82206 152 1 2 81 79 35 0.225 -0.06 3.08 Intr + 83442 83603 162 0 0 105 100 58 0.956 6.87 3.09 Intr + 88177 88280 104 1 2 122 60 20 0.621 1.50 3.10 Term + 99882 100546 665 1 2 41 44 573 0.832 41.24 3.11 PlyA + 103122 103127 6 1.05 4.08 PlyA - 104138 104133 6 1.05 4.07 Term - 115226 114880 347 0 2 6 42 229 0.206 3.77 4.06 Intr - 117216 117086 131 2 2 77 94 46 0.412 3.52 4.05 Intr - 123842 123713 130 0 1 115 31 83 0.553 4.13 4.04 Intr - 127204 127058 147 2 0 48 70 204 0.992 13.89 4.03 Intr - 130049 129889 161 1 2 80 111 137 0.952 13.91 4.02 Intr - 143092 142923 170 1 2 12 46 132 0.060 -0.78 4.01 Init - 145925 145923 3 2 0 108 81 0 0.108 1.35 4.00 Prom - 149268 149229 40 -6.65 5.00 Prom + 150343 150382 40 -5.35 5.01 Init + 158063 158104 42 1 0 82 109 0 0.627 1.97 5.02 Intr + 159406 159785 380 0 2 -53 78 412 0.878 18.94 5.03 Intr + 173453 173620 168 2 0 44 41 129 0.009 1.94 5.04 Intr + 175844 175965 122 2 2 55 73 110 0.146 5.42 5.05 Term + 177248 177270 23 0 2 81 43 53 0.245 -2.40 5.06 PlyA + 177397 177402 6 1.05 6.02 PlyA - 178845 178840 6 1.05 6.01 Sngl - 185230 184217 1014 1 0 44 46 422 0.543 30.36 6.00 Prom - 185488 185449 40 -14.62 7.06 PlyA - 185726 185721 6 1.05 7.05 Term - 186537 185737 801 0 0 68 42 629 0.180 48.55 7.04 Intr - 187322 187165 158 0 2 51 33 102 0.043 -0.19 7.03 Intr - 192429 192284 146 2 2 56 106 20 0.061 -0.39 7.02 Intr - 192982 192870 113 1 2 83 37 143 0.062 6.96 7.01 Init - 196014 195583 432 0 0 77 2 295 0.688 16.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 30582 30510 73 0 1 77 101 237 0.898 23.18 S.002 Term - 147960 147872 89 1 2 91 42 107 0.826 3.24 S.003 Term + 160355 160451 97 2 1 122 44 68 0.911 2.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:148648953_148849495|GENSCAN_predicted_peptide_1|74_aa MLKIYSLAPDPNQQYTNIKEKVTETKITHLMKIYDLAKENEAENIHSSEIRHRPLRMPDQ ISFPNAENQLLLTF >gi568815597f:148648953_148849495|GENSCAN_predicted_CDS_1|225_bp atgctgaaaatctacagcctggcccctgatccaaatcagcaatatactaatattaaggaa aaagtaacagaaaccaaaatcacccacctaatgaagatatatgatctagcaaaagaaaat gaggctgagaacatccacagctcagaaatccgacataggccacttcgcatgcctgatcag ataagctttcccaatgctgaaaaccaactgcttctaaccttttaa >gi568815597f:148648953_148849495|GENSCAN_predicted_peptide_2|261_aa MVRAREGFAGEVIFELGDVFSEVGSEAFCWEENGEQKEGKRLQQRGGGGGGGDREDARPA PRSAVGAAGALAVLRDPRAWREAGSKSQKLLFRSARVQGGGQFCPSGSAFLGVEREPTAG LGGAERRRARFWRGERGQGRQAKRPAPSQPASPLPGGGTWAGTCLACAPNSESTERQVLV TASVTPAFPPLEHSIGEADGTPLVLTFVAPAELLYAEALQVSSVWFRVISGHMNAFLSII GIYWAGCLPAITIFSMKRPGE >gi568815597f:148648953_148849495|GENSCAN_predicted_CDS_2|786_bp atggtgagggctagggaaggttttgcaggggaggtgatatttgaactaggagatgtcttt agtgaagtaggaagtgaagccttttgctgggaagaaaatggggaacagaaagaaggaaag aggctacagcagagaggcggcggcggcggaggaggcgaccgagaagatgcccgccctgcg ccccgctctgctgtgggcgctgctggcgctctggctgtgctgcgcgacccccgcgcatgg agggaggccggcagcaagtctcagaaactcctttttcgtagtgccagggtgcagggaggt gggcagttttgcccttcaggttccgcgtttcttggggtcgagcgagagccgacggcgggc ctcggaggggctgagcgaaggcgtgccagattctggcgtggagagcgggggcagggccgc caagccaaacggcctgcaccttcgcagccagcctcgcctttgccagggggcggcacatgg gccggaacttgtctggcatgtgccccaaattcagagtcaactgagcggcaggtcttagtc acagcatctgtgactcctgctttcccacctttagaacattctataggagaggctgatggc accccgctggttttgacattcgttgctccagctgaactgctctatgctgaagccctgcag gtctcaagtgtttggttcagagtaatttcaggccatatgaatgctttcttaagtatcatt ggaatttactgggctggctgtctgccagctattaccatatttagcatgaagagacctggg gaatga >gi568815597f:148648953_148849495|GENSCAN_predicted_peptide_3|645_aa MGRLDYSDEPDVITGVLIEKGGRSVRERDVTMRDHQPKPKTNKQKQKLWLEAGRGKEQCL SSSSRRHCHAPGGVGGDASNPLGFDLAERGLSFQLLGLFLKKMRDGLPGHSYTGYVNTVP ATPNFLFEMKVPHPQPPALRSRGIMKCGIKLLLGQGGPLHSVGSTSLNSVMLNEARFQLC RAGMQPCLNLGGGAQEIIQAPLPRAAQGRRPGLQGPPQSLSPRNMILTPPKVASSCSALD KRAGHCCWGDISRQARESCRLLSAYLLAGKTNIGQLFSKGGMQSTEDPLTYYLCQALYSS QQCWKIGITILKLQMRKLRSGDLSVSEIPNPRGMDRLPEGSSETCVRKEKLSGLSSLLHD HSFSPTYHCLAWNGSWKEDSRETIETDNRKDAVISELSLEHFSPVCQGFREGFPNIQAGC QGFLTLPGAPAPDSPSPPLPGGVAAAGPPRRRPEQQQQQQEPASMMKFKPNQTRTYDREG FKKRAACLCFRSEQEDEVLLVSSSRYPDQWIVPGGGMEPEEEPGGAAVREVYEEAGVKGK LGRLLGIFEQNQDRKHRTYVYVLTVTEILEDWEDSVNIGRKREWFKVEDAIKVLQCHKPV HAEYLEKLKLGCSPANGNSTVPSLPDNNALFVTAAQTSGLPSSVR >gi568815597f:148648953_148849495|GENSCAN_predicted_CDS_3|1938_bp atgggaagattagattattcagacgaacctgatgtaatcacaggggtccttatagaaaaa ggaggcaggagtgtcagagaaagagatgtgacaatgagggaccatcagccaaaaccaaaa acaaacaaacaaaaacaaaaactgtggttagaagctggaagaggcaaagaacagtgtctc tcctccagctccaggagacattgtcatgctccaggtggggtgggtggtgatgccagtaat cccctgggctttgacttagctgagcgtggactcagttttcagttgcttggcttgttcttg aagaaaatgagagatgggctcccagggcactcctatactggatatgtcaacacagtgcct gccacacccaacttcctctttgagatgaaagtgccccacccacagccccctgctttgaga agcagagggataatgaagtgtggcataaagcttctcctgggtcagggtggcccactccac agtgttggcagcaccagtctcaacagtgtgatgctgaatgaagctaggttccagctgtgt agagctggcatgcagccctgcctgaacctcggaggaggcgcccaggagatcatccaggca ccgctgcctagggccgcgcagggaagacggccagggctccagggtccgcctcagtcactg tcaccccgcaacatgattctgactcccccaaaggtggccagcagctgctccgcacttgac aaacgtgctggacactgctgctggggagacatcagccggcaagcaagggaaagttgcaga cttctcagtgcttacctcctagctgggaagacaaatattggacaactgttctctaaagga ggcatgcagagtactgaagacccactgacctattatttgtgtcaggcactttattcctca caacaatgctggaagattggtataactattctcaaattacagatgaggaaactgaggtct ggggatctgtctgtgtcggagatccccaacccccggggcatggacaggcttcctgagggc tcctcagaaacatgtgttagaaaagaaaagctttccggtctgtccagtctcctccatgac cacagtttctcacccacataccattgtctggcttggaatggatcctggaaggaggactca agagaaaccatagaaacagacaacaggaaagatgctgtcatttctgagttatctttggag catttctctccagtgtgtcagggtttcagagaaggctttcctaatattcaggctggctgt cagggtttcctaacgctcccgggcgctcctgcgcccgactcgccctcgcccccactcccc ggcggggtggcggcggccgggcccccacggcggcggccggagcagcagcagcagcagcag gagcccgcctctatgatgaagttcaagcccaaccagacgcggacctacgaccgcgagggc ttcaagaagcgggcggcgtgcctgtgcttccggagcgagcaggaggacgaggtgctgctg gtgagtagcagccggtacccagaccagtggattgtcccaggaggaggaatggaacccgag gaggaacctggcggtgctgccgtgagggaagtttatgaggaggctggagtcaaaggaaaa ctaggcagacttctgggcatatttgagcagaaccaagaccgaaagcacagaacatatgtt tatgttctaacagtcactgaaatattagaagattgggaagattctgttaatattggaagg aagagagagtggttcaaagtagaagatgctatcaaagttctccagtgtcataaacctgta catgcagagtatctggaaaagctaaagctgggttgttccccagccaatggaaattctaca gtcccttcccttccggataataatgccttgtttgtaaccgctgcacagacctctgggttg ccatctagtgtaagatag >gi568815597f:148648953_148849495|GENSCAN_predicted_peptide_4|362_aa MRLSRDFQSAPCSFRLIPPSPLTLHFHFSDWMHCAEDTAKGCENSRIKKSPAFTQLALYI IEQGVCDLVLCEAAFPKTLAFAYLEDLHSEFDEQHGKKVPTVSRPYSFIEFDTFIQKTKK LYIDSCARRNLGSINTELQDVQRIMVANIEEVLQRGEALSALDSKANNLSSLSKKYRQDA KYLNMHSTYAKLAAVAVFFIMLIVNEATWLSPLYATFRPPRASKKTKAEKPIQETATSKV EATSDHTERLLKSQKAIDAGVVAEKMEHLYSANGNVNSFSRCAKHFGDFSKNRQQLPLDP AIPLLDIYLKEYRLFCHKDTCTHMFIAALFTIAKTWNQPKCPSVVVWIKKMWYTYTVEYT QP >gi568815597f:148648953_148849495|GENSCAN_predicted_CDS_4|1089_bp atgaggctttcacgggacttccaaagcgcaccttgttctttccggctcattcccccttcc cctctaacccttcactttcacttctctgactggatgcactgtgcggaggacacagctaag ggctgtgaaaattcaagaataaagaaaagtcctgccttcacgcagctcgcactctacatt attgagcagggggtgtgtgatttggttttatgtgaagctgccttccctaagacgttggct tttgcctacctagaagatttgcactcagaatttgatgaacagcatggaaagaaggtgccc actgtgtcccgaccctattcctttattgaatttgatactttcattcagaaaaccaagaag ctctacattgacagttgtgctcgaagaaacctaggctccatcaacactgaattgcaagat gtgcagaggatcatggtggccaatatcgaagaagtgttacaacgaggagaagcactctca gcattggattcaaaggctaacaatttgtccagtctgtccaagaaataccgccaggatgcg aagtacttgaacatgcattccacttatgccaaacttgcagcagtagctgtatttttcatc atgttaatagtaaatgaggccacttggctgagcccattatatgcaacattcagaccccca agggcatcaaagaagacaaaagcagaaaaacctatccaagaaacagcaacttcaaaggtt gaagcaacatcagaccacacagaacggctattaaaaagtcaaaaagcaatagatgccggc gtggttgcagagaaaatggaacacttatatagtgctaatgggaatgtaaattcgttcagc cgttgtgcaaagcactttggtgatttctcaaagaaccggcaacaattaccactcgaccca gcgatcccattattggatatatacctaaaggaatatagattgttctgccataaagacaca tgcacgcatatgttcatcgcagcactattcacgatagcaaagacatggaatcaacctaaa tgcccatcagtggtagtctggataaagaaaatgtggtacacgtacaccgtggaatataca cagccataa >gi568815597f:148648953_148849495|GENSCAN_predicted_peptide_5|244_aa MKLKLGGLKERMGQVAEAAAEEAQGTDGDAQPGPEDAPWQSRKTPGKQAGTVARTGVARP RKSMKGTDSGSCCRRRCDFGCCCRASRRAHYTPYRSGDATRTPQSPRQTPSRERRRPEPA GSWAAAAEEEEAAAAATPWMRPAAAFNSHRNGNPIVNWAYEGSRLPASYEDLIPDDHSKS IPHPVHGKVVFLENGSCHPFNHRWKTLLAVTGPGDFSVFLRALKAKVHIYGSSVLPEECT EASK >gi568815597f:148648953_148849495|GENSCAN_predicted_CDS_5|735_bp atgaagcttaagttaggaggcttaaaagaaagaatgggccaggtggcagaggcagccgcg gaggaggcccagggaactgacggggatgctcagccaggcccagaggatgcgccctggcag tcccggaaaacaccaggaaaacaagcaggaaccgtagctaggactggggtggccaggccc aggaaatccatgaagggcacagacagcgggtcctgctgccgccgccgatgcgactttggc tgctgctgtcgcgcgtcccgccgggctcactacacgccttaccggtccggggacgcgacg cgaacccctcagtccccacggcagaccccgagccgggaaagacggcgcccggagccagcc gggagctgggcagcagcggccgaggaggaagaagcagctgcggcggccacaccctggatg agaccagcagctgcattcaattctcataggaacgggaaccctattgtgaactgggcatat gagggatctaggttgcctgcttcttacgaggatctaatacctgatgatcattccaaatcc atcccccaccctgtccatggaaaagttgtcttccttgaaaatggttcctgtcatcccttc aaccaccgctggaagacattgctggcagtgactggccctggggacttcagtgtcttcttg cgtgccttaaaggctaaggttcacatctatggctccagtgtcctacccgaggaatgtacc gaagccagcaaataa >gi568815597f:148648953_148849495|GENSCAN_predicted_peptide_6|337_aa MVKGSIQQEELTILNIYAPNTGAPRLIKQVLRDLERDLDSHTLIMGDFNTPLSTVDRSTR QKVNKDIQELNSPLHQVDLIDIYRTLHHKSTEYTFFSAPHRTYSKIDHIVGSKALLSKCK RTEIIINCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWVHNEMKTEMKMFFETNE NKDTMYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFEKINKIDRPLARLIQKKREKNQIDAIKNDKG DITTDPTEIQTNIREYYKHLYANKLENLEEMDKFLDT >gi568815597f:148648953_148849495|GENSCAN_predicted_CDS_6|1014_bp atggtaaagggatcaattcaacaagaagagctaactatcctaaatatatatgcacccaat acaggagcacccagattgataaagcaagtccttagagacctagaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacagtagacagatcaacgaga cagaaagtcaacaaggatatccaggaattgaactcacctctgcaccaagtggacctaata gacatctacagaactctccatcacaaatcaacagaatatacattcttctcagcaccacac cgcacttattccaaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaa agaacagaaattataataaactgtctctcagaccacagtgcaatcaaactagaactcagg attaagaaactcactcaaaaccgctcaactacatggaaactgaataacctgctcctgaat gactactgggtacataatgaaatgaagacagaaatgaagatgttctttgaaaccaacgag aacaaagacacaatgtaccagaatctctgggacacattcaaagcagtgtgtagaggaaaa tttatagcactaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaaca tcacaattaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaagatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaagatcaacaaaattgatagaccgctagca agactaatacagaagaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaacatcagagaatactataaacacctc tacgcaaataaactagaaaatctagaagaaatggataaattcctcgacacatga >gi568815597f:148648953_148849495|GENSCAN_predicted_peptide_7|549_aa MTKRGGLGTRQGEATEKGSCHGTLVVRGDLSLPKGAGGGEGGGPPSLERGEKETAHLPRP GLLHFPGSFLSLRTLLTREGKQPGKMRSLAHESPAQKAAAFSSPAPFCVGGEERALLPAF LENGFDVAEPAASRATRTREGLFSPCEDTASVATRSHLGSREQALPDTESASTLILVQPS ELIYSLGMDWFKSVIFMAECRSARELAETHDDWSLLRTGTVSLLPFSVHQILANGTPGDY IPCMARGGPTPIEPPSLLAQQTEIELQGGSEAGGGAPAIAEAGERSSSPAMEQSWMENDF DKLREEGFRQSNYSELKEEVRTHGKEVKNLEKKLDQWLTRITNTEKSLKDLMELKTKARE LRDKCTSLSSRFDQLEERVSVMEDEMNEMKQEEKFKEKRIKRNEQSLQEIWGYVKRPNLR LIGVPESDRENGTKLENTLQDNNQENLPNLARQANIQIQEIQRMPQRYSLRRATPRQIIV RFTEVEMKEKMLRAAREKGRVTLKGKPIRLTADLLAETLQARREWGLIFNILKEKNFQPE FHIQPNEAS >gi568815597f:148648953_148849495|GENSCAN_predicted_CDS_7|1650_bp atgaccaagagaggtgggctgggaaccaggcagggcgaggctacagaaaagggaagctgt cacgggaccctggtggtgaggggcgacctctctctccctaagggcgctgggggaggggag ggaggaggaccaccaagtctggagcgcggcgagaaggaaacagcccacctacctcgtcca ggtctgctccattttccaggctctttccttagtctcaggacgctcctcacccgggagggg aagcagcctgggaaaatgagaagccttgcccacgaatctccagcgcaaaaggcagcagct ttttcctccccagctcctttctgcgtcggcggcgaagagagagctctgctccctgctttt ttagaaaatggatttgacgtggccgaacctgcggctagccgtgcgacccgcacaagggag ggactgttctcaccatgtgaggacacagcatctgtggcaacaaggagccatcttgggagc agagagcaagctcttccagacactgaatctgccagcaccttgatcttggtccagccttca gaactgatctatagtttaggtatggattggtttaagtctgtcattttcatggcagagtgc aggagtgcaagagagttggcagagactcatgatgactggtctctgctcagaactggcaca gtctcgcttctgccattttccgttcaccaaatcttagcaaacggcacaccaggagattat atcccgtgcatggctcgggggggtcctacgcccatagagcctccctcattgctggcacag cagactgagatcgaactgcaaggcggcagcgaggctgggggaggggcacctgccattgcc gaggctggagaacgcagctcctccccagcaatggaacaaagctggatggagaatgacttt gacaagttgagggaagaaggcttcagacaatcaaactactccgagctaaaggaggaagtt cgaacccatggcaaagaagttaaaaaccttgaaaaaaaattagaccaatggctaactaga ataaccaatacagagaagtccttaaaggacctgatggagctgaaaaccaaggcacgagaa ctacgtgacaaatgcacaagcctcagtagccgattcgatcaactggaagaaagggtatca gtgatggaagatgaaatgaatgaaatgaagcaagaagagaagtttaaagaaaaaagaata aaacgaaatgaacaaagccttcaagaaatatggggctatgtgaaaagaccaaatctacgt ctgattggtgtacctgaaagtgacagggagaatggaaccaagttggaaaacactctgcag gataataaccaggagaacctccccaatctagcaaggcaggccaacattcaaattcaggaa atacagagaatgccacaaagatactccttgagaagagcaactccaagacaaataattgtc agattcaccgaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgg gttaccctcaaagggaagcccatcagactaacagcggatctcttggcagaaactctacaa gccagaagagagtgggggctaatattcaacattcttaaagaaaagaattttcaaccagaa tttcatatccagccaaatgaagcttcataa