GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:55:39 Sequence gi568815575f:83408325_83609407 : 201083 bp : 36.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 164 799 636 1 0 45 43 284 0.437 15.53 1.02 PlyA + 1027 1032 6 1.05 2.00 Prom + 1195 1234 40 -6.15 2.01 Init + 2142 2304 163 2 1 70 72 96 0.747 6.15 2.02 Term + 2870 3135 266 0 2 66 38 210 0.967 8.49 2.03 PlyA + 3633 3638 6 1.05 3.00 Prom + 3749 3788 40 -5.65 3.01 Sngl + 10837 11469 633 0 0 47 55 219 0.592 10.48 3.02 PlyA + 11762 11767 6 1.05 4.04 PlyA - 14051 14046 6 -0.45 4.03 Term - 14879 14749 131 0 2 103 45 62 0.006 0.76 4.02 Intr - 16092 15971 122 2 2 67 81 42 0.008 0.62 4.01 Init - 26802 26579 224 0 2 88 72 153 0.145 11.78 4.00 Prom - 32381 32342 40 -4.55 5.04 PlyA - 34301 34296 6 1.05 5.03 Term - 43986 43828 159 0 0 98 42 181 0.966 11.46 5.02 Intr - 49119 49092 28 2 1 103 78 35 0.284 1.20 5.01 Init - 73947 73793 155 0 2 34 55 124 0.261 3.20 5.00 Prom - 76869 76830 40 -6.05 6.00 Prom + 80012 80051 40 -5.35 6.01 Sngl + 100001 101086 1086 1 0 106 42 983 0.995 92.30 6.02 PlyA + 103540 103545 6 1.05 7.00 Prom + 128453 128492 40 -5.05 7.01 Sngl + 134977 135993 1017 0 0 88 43 752 0.999 67.47 7.02 PlyA + 136011 136016 6 -4.04 8.00 Prom + 136093 136132 40 -5.25 8.01 Sngl + 136351 139281 2931 0 0 44 47 1007 0.725 84.30 8.02 PlyA + 140083 140088 6 1.05 9.00 Prom + 160118 160157 40 -3.65 9.01 Init + 170601 170670 70 2 1 67 105 78 0.493 8.66 9.02 Term + 176131 176195 65 2 2 74 47 60 0.070 -2.43 9.03 PlyA + 177398 177403 6 1.05 10.00 Prom + 181035 181074 40 -2.85 10.01 Init + 189102 189273 172 2 1 73 98 112 0.672 10.35 10.02 Term + 197065 197081 17 2 2 137 43 6 0.665 -1.58 10.03 PlyA + 197247 197252 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 161901 162158 258 2 0 65 33 162 0.852 3.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_1|211_aa MKQEEKFREKRVKRNEQSLQEIWDYVKRPNLCLIGVPESDGENGTKLENTLQDIIQENFP NGARQANIQIQEIQRMPQRYSLRKATPRHIIVRFTKVEMKEKLLRAAREKGQVTHKGKPI RLTADFSAETLQARREWGSIFNILKEKNFQPRISYPAKLSCISEGEIKSFTDEEMLRDFV TTRPALQELLKEALNMERNNWYQLLQKHAKL >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_1|636_bp atgaagcaagaagagaagtttagagaaaaaagagtgaaaagaaatgaacaaagcctccaa gaaatatgggactatgtgaaaagaccaaatctatgtctgattggtgtacctgaaagtgat ggggagaatggaaccaagttggaaaacactctgcaggatattatccaggagaacttcccc aatggagcaaggcaggccaacattcaaattcaggaaatacagagaatgccacaaagatac tccttgagaaaagcaactccaagacacataattgtcagattcaccaaagttgaaatgaag gaaaaactgttaagggcagccagagagaaaggtcaggttacccacaaagggaagcccatc agactaacagctgatttctcggcagaaactctacaagccagaagagagtgggggtcaata ttcaacattcttaaagaaaagaattttcaacccagaatttcatatccagccaaactaagc tgcataagtgaaggagaaataaaatcctttacagatgaggaaatgctgagagattttgtc accaccaggcctgccttacaagagctcctgaaggaagcactaaacatggagaggaacaac tggtaccagctactgcaaaaacatgccaaattgtaa >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_2|142_aa MDKFLNTYTLPRLNQEEVESLDRPITGSEIEAIIKSLPTKKVQDRTDSQLNSTRVLEVLA RAVRQEKEKKGIQSGKEEVKLSLFADDMIVYLENPIVSAPNRLKLISNFSKVSGYKINVQ KSQAFLYTNNRQTESQKLLRRE >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_2|429_bp atggataaattcctgaacacatacaccctcccaagactaaaccaggaagaagttgaatct ctggatagaccaataacaggctctgaaattgaagcaataattaagagcctaccaacaaaa aaagtccaggacaggacggactcacagctgaattctaccagagtgttggaagttctggcc agggcagtcaggcaggagaaagaaaaaaagggtattcagtcaggaaaagaggaagtcaaa ttgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcgtgtcagcccca aatcgccttaaactgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcacaagcattcttatacaccaataacagacaaacagagagccaaaaattgcttcga agagaataa >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_3|210_aa MPPVLADHDLPAESSSAVASLPARYPHCNFHQAPVTPILLCWHVCAFLALPVHVCACTLP CHCCSESAVHPFLSLPTTIAGRALTGTAPASPIPASASPLHQQCRKSETRYEEEQTLPCP QQPPLTATHRECTQTWAHQCPAPCKHHYQCDCTHSHQQRPRETAPSYIFPNLVVNSHTET SSPAGATTLPQPMNMQPAALPLLGAAGMCK >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_3|633_bp atgcctcctgtgctggcggaccatgacttaccagcagagagctcaagtgcggtggccagc ctacctgctcgttacccacactgcaactttcaccaggccccagtaacgcccatattgctt tgctggcacgtgtgtgcctttcttgccctgccagtgcacgtgtgtgcatgcaccctgccc tgccactgctgcagcgagagcgcagttcaccccttcctctccttgccaaccaccattgca ggcagagccttgacgggcacagcaccagccagccccatacccgccagtgcctcacccttg catcaacagtgccgcaaaagtgaaactaggtacgaagaagagcagaccctcccctgccct cagcaaccacctctgactgcaacacacagagagtgcacacaaacctgggcccaccagtgc cccgccccatgcaaacaccactaccagtgtgactgcacacacagtcaccagcagaggccc cgtgaaaccgcccccagctacattttccccaaccttgtggtgaactcccacactgagaca agcagcccagcaggcgctaccacccttccacagccaatgaacatgcaaccagctgcactg ccactgctgggggctgctggcatgtgcaaatga >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_4|158_aa MGRNQHKKAEHSKKQNASSPPKDHNSSPARKQNWTENEYDKLSEVGFRRWTITNSSELKE HVLTQCKEAKNLEQRLYRKHGSISFWEGLWNILLMAEGKAGAGILTWPEQQKGREGLTPH GSCQGLWFALSEAVTQIVSGTLLGIAGAGAAGMQGPVS >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_4|477_bp atggggagaaaccagcacaaaaaggctgaacattccaagaaacagaacgcctcttctcct ccaaaggatcacaactcctcgccagcaaggaaacaaaactggacggagaatgagtatgac aaattgtccgaagtaggcttcagaaggtggacaataacaaactcttctgagctaaaggag catgttctaacccaatgcaaagaagctaagaaccttgaacaaagactgtacaggaaacat ggcagcatcagcttctgggaaggcctctggaatattttacttatggcagaaggcaaagca ggagcaggcattttaacatggccagagcaacagaaaggccgagaaggcttaacaccacat ggaagctgccaaggcttatggtttgcactctctgaagcagtgactcaaattgtatctggg acccttttaggaatagctggagctggagcagctgggatgcagggaccagtgtcctga >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_5|113_aa MLVDTEKTFDKIQHPFMIKPLSKIGIKGAYLKIIKAIYDKSTANIILIREKFSRKQEVGD EWLHDAKSISGHWERESIEIVRQQIQYCPVRAEKKIRENSIDTRPQREKLNQP >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_5|342_bp atgttagtagacacagaaaaaacatttgacaagatccaacatccctttatgattaaaccc ctcagcaaaattggcatcaaaggagcatacctcaagataataaaagccatctatgacaaa tccacagccaacattatactgatcagggaaaagttctcaaggaagcaggaagttggtgat gagtggctgcatgatgccaagagcatctctgggcactgggagagggagagcatagaaatt gtgaggcagcaaattcagtactgtcctgttagagcagaaaagaaaattcgagaaaactca attgacacccgtcctcagagggagaaattaaaccagccctag >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_6|361_aa MATAASNPYSILSSTSLVHADSAGMQQGSPFRNPQKLLQSDYLQGVPSNGHPLGHHWVTS LSDGGPWSSTLATSPLDQQDVKPGREDLQLGAIIHHRSPHVAHHSPHTNHPNAWGASPAP NPSITSSGQPLNVYSQPGFTVSGMLEHGGLTPPPAAASAQSLHPVLREPPDHGELGSHHC QDHSDEETPTSDELEQFAKQFKQRRIKLGFTQADVGLALGTLYGNVFSQTTICRFEALQL SFKNMCKLKPLLNKWLEEADSSTGSPTSIDKIAAQGRKRKKRTSIEVSVKGVLETHFLKC PKPAAQEISSLADSLQLEKEVVRVWFCNRRQKEKRMTPPGDQQPHEVYSHTVKTDTSCHD L >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_6|1086_bp atggccacagctgcctcgaatccctacagcattctcagttccacctccctagtccatgcg gactctgcgggcatgcagcaggggagtcctttccgcaaccctcagaaacttctccaaagt gattacttgcagggagttcccagcaatgggcatcccctcgggcatcactgggtgaccagt ctgagcgacgggggcccatggtcctccacactggccaccagccccctggaccagcaggac gtgaagcccgggcgcgaagacctgcaactgggtgcgatcatccatcaccgctcgccacac gtagcccaccactcaccgcacactaaccaccccaacgcctggggggccagcccggcaccg aacccgtctatcacgtcaagcggccaacccctcaacgtgtactcgcagcctggcttcacc gtgagcggcatgctggaacacgggggactcaccccacctccagctgccgcctctgcacag agcctgcacccggtgctccgagagcccccggatcacggcgaactgggctcgcaccattgc caggatcactccgacgaggagacgccaacctctgatgagttggaacagttcgccaaacaa ttcaaacaaagaagaatcaagttgggcttcacgcaggccgacgtggggttggcgctgggc acactgtatggtaacgtgttctcgcagaccaccatctgcaggttcgaggccttgcagctg agcttcaaaaatatgtgcaagctgaagcccctgctgaacaagtggctggaggaggcggat tcgtccacagggagcccgaccagcattgacaagatcgctgcacagggccgcaagcgcaag aagcggacctccatcgaggtgagtgtcaagggcgtactggagacgcatttcctcaagtgt cccaagcctgccgcgcaggagatctcctcgctggcagacagcctccagttggagaaggaa gtggtgcgtgtctggttctgtaatcgaagacaaaaagagaaaagaatgactccgccaggg gatcagcagccgcatgaggtttattcgcacaccgtgaaaacagacacatcttgccatgat ctctga >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_7|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLERQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWRPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_7|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacagaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagaaaggcaggccaacgttcagattcaggaaata cagagaacaccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtggaggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_8|976_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHHKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCK RTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKG DITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLLRLNQEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLFKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP FTIASKRIKYLGIQLTRDVKELFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLY YKATVTKTAWDWYQNRDIDQWNRTESSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWE NWLTICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKT PKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELK QIYKKKTTPSKSGRRT >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_8|2931_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatttatgcacccaat acaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcagacctaata gacatctacagaactctccaccacaaatcaacagaatatacattcttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctataagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagca agattaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ctaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccgctcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccattgtctcagcccaaaatctc tttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gaactcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagccggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatgggactggtaccaaaacagagatatagatcaa tggaacagaacagagtcctcagaaataatgccgcatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctaaccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactcaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaa attttcgcaacctactcatctgacaaagggctaatatccagaatctataatgaactcaaa caaatttacaagaaaaaaacaaccccatcaaaaagtgggcgaaggacatga >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_9|44_aa MKTLLYFNKETGYNTQCQGVTSPDKGTSTKRDEKELTQELWQLK >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_9|135_bp atgaaaaccctgctgtatttcaacaaagaaaccggatacaatacacagtgtcaaggagta acaagtcctgataaaggaacatccacaaagagagatgagaaagaactaacacaagaactc tggcaactcaaatga >gi568815575f:83408325_83609407|GENSCAN_predicted_peptide_10|62_aa MTVMKKEIFILTDFSEQETYQATLRESISVVKRQRKQEVNVAKSPSCDSYVKNQARQGLS SH >gi568815575f:83408325_83609407|GENSCAN_predicted_CDS_10|189_bp atgactgtcatgaagaaagaaatttttatactgacagatttctcagaacaagagacatac caagccacactgagggaaagcatcagtgtagtcaagaggcagaggaagcaagaggtaaat gtggccaagagtccttcttgtgattcctatgtgaagaaccaggcaaggcagggcctttca agccattag