GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:28:05 Sequence gi568815583r:68680084_68920751 : 240668 bp : 44.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3648 3719 72 0 0 118 -2 79 0.079 0.42 1.02 Intr + 6927 7027 101 0 2 71 89 34 0.164 1.55 1.03 Intr + 14295 14497 203 1 2 46 49 87 0.569 -0.40 1.04 Intr + 15057 15173 117 2 0 133 97 204 0.959 26.46 1.05 Intr + 16544 16568 25 1 1 87 75 8 0.239 -3.00 1.06 Intr + 20026 20126 101 2 2 67 78 77 0.393 4.43 1.07 Intr + 30649 30798 150 0 0 120 91 319 0.999 35.76 1.08 Intr + 31459 31623 165 0 0 128 44 284 0.997 28.16 1.09 Intr + 33842 33958 117 1 0 133 76 97 0.906 13.66 1.10 Intr + 34476 34580 105 2 0 50 62 132 0.976 7.21 1.11 Intr + 35132 35228 97 1 1 90 97 218 0.999 22.48 1.12 Intr + 38615 38727 113 0 2 60 110 259 0.533 25.40 1.13 Intr + 39061 39151 91 0 1 82 66 80 0.997 4.77 1.14 Intr + 39330 39469 140 1 2 60 94 134 0.652 11.38 1.15 Term + 45760 45891 132 0 0 117 36 356 0.975 31.39 1.16 PlyA + 46069 46074 6 1.05 2.03 PlyA - 47272 47267 6 1.05 2.02 Term - 75032 74897 136 1 1 136 36 112 0.726 8.29 2.01 Init - 88375 88314 62 1 2 78 73 34 0.067 1.62 2.00 Prom - 93458 93419 40 0.14 3.12 PlyA - 97680 97675 6 1.05 3.11 Term - 100059 99998 62 1 2 112 42 114 0.998 7.17 3.10 Intr - 100390 100300 91 1 1 106 29 157 0.581 11.17 3.09 Intr - 102970 102873 98 2 2 38 94 293 0.978 24.63 3.08 Intr - 104512 104314 199 1 1 91 64 428 0.999 39.62 3.07 Intr - 107452 107330 123 1 0 73 119 35 0.983 5.88 3.06 Intr - 107836 107687 150 1 0 105 94 138 0.927 16.46 3.05 Intr - 113559 113330 230 1 2 63 71 83 0.161 1.69 3.04 Intr - 116528 116471 58 2 1 76 97 11 0.063 -0.74 3.03 Intr - 125847 125779 69 0 0 95 60 44 0.073 1.68 3.02 Intr - 139630 139480 151 0 1 72 52 58 0.016 0.76 3.01 Init - 140976 140816 161 0 2 46 92 189 0.350 12.40 3.00 Prom - 143113 143074 40 -3.16 4.00 Prom + 146752 146791 40 -6.96 4.01 Init + 147367 147382 16 0 1 99 71 24 0.476 0.84 4.02 Term + 154095 154666 572 1 2 36 49 415 0.651 27.00 4.03 PlyA + 155279 155284 6 1.05 5.00 Prom + 159213 159252 40 -4.26 5.01 Init + 170838 170848 11 2 2 114 67 -6 0.043 -0.26 5.02 Intr + 172563 172691 129 0 0 59 94 64 0.039 3.91 5.03 Intr + 197072 197094 23 2 2 75 80 40 0.003 -0.91 5.04 Intr + 216560 216736 177 0 0 79 82 67 0.105 5.09 5.05 Term + 223688 223872 185 0 2 35 46 139 0.210 2.11 5.06 PlyA + 225333 225338 6 1.05 6.00 Prom + 234140 234179 40 -5.96 6.01 Init + 236408 236902 495 1 0 94 41 211 0.772 12.11 6.02 Term + 237609 238592 984 2 0 49 34 295 0.830 12.27 6.03 PlyA + 240198 240203 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:68680084_68920751|GENSCAN_predicted_peptide_1|576_aa XACSTSRQESVALVKTTVIMVLTACCSGIAISHLSHVDSIWIPVFCAGTLEAWHSVVAER GNRGSEKFINFPKATTFTAKNQQSCNSNTGLWDSEDYAFNHRALCTGVQDVHCRRTLIHI VERMARTGRIEPNYPKVCGHQGNVLDIKWNPFIDNIIASCSEDTSTLPTFTSGAPEASAT EALPPPIPAGMVNLLLTRHDIQTQAWEVRIWEIPEGGLKRNMTEALLELHGHSRRVGLVE WHPTTNNILFSAGYDYKVLIWNLDVGEPVKMIDCHTDVILCMSFNTDGSLLTTTCKDKKL RVIEPRSGRVLQEANCKNHRVNRVVFLGNMKRLLTTGVSRWNTRQIALWDQEDLSMPLIE EEIDGLSGLLFPFYDADTHMLYLAGKGDGNIRYYEISTEKPYLSYLMEFRSPAPQKGLGV MPKHGLDVSACEVFRFYKLVTLKGLIEPISMIVPRRSDSYQEDIYPMTPGTEPALTPDEW LGGINRDPVLMSLKEGYKKSSKMVFKAPIKEKKSVVVNGIDLLENVPPRTENELLRMFFR QQDEIRRLKEELAQKDIRIRQLQLELKNLRNSPKNC >gi568815583r:68680084_68920751|GENSCAN_predicted_CDS_1|1731_bp nnagcatgttctacctctcggcaagagagtgtggccttggtcaagaccactgtcatcatg gtgctcacagcctgttgttctggcattgccatctcacatctttctcatgtggactcaata tggattcctgtgttctgtgctggtacactggaagcatggcactctgtggttgcagagagg ggaaaccgaggctcagagaagttcattaactttcctaaggccacaactttcactgctaaa aatcagcagagctgtaattcaaacacaggcctgtgggactccgaagactatgctttcaac cacagggcactgtgtaccggtgttcaagatgtacactgcagaaggacgctcattcatatc gtagagagaatggctcgtacaggcaggattgaacccaactaccccaaggtctgcggccac cagggcaatgtgctggatatcaaatggaaccccttcatcgacaacatcattgcctcgtgc tcggaggacacgtcgactctaccaactttcaccagtggagcacctgaggcatcggccacc gaggccctccctccacccattccagctggaatggtcaacttgctgctcaccagacacgac atccagacccaggcttgggaggtgcggatctgggagatccccgagggcgggctgaagcgg aacatgacggaggcgctcctggagctgcacgggcacagccggcgtgtggggctggtcgag tggcaccccaccaccaacaacatcctgttcagcgctggctacgactacaaggtcctcatc tggaacctggatgtgggtgagccggtgaagatgattgactgccacacggatgtgatcctc tgcatgtccttcaacacggacggcagcctgctcaccaccacgtgcaaggacaagaagctg cgtgtgattgagccccgctctggccgtgttctgcaggaggccaactgcaaaaaccacaga gtgaaccgggtggtgttcctggggaacatgaagcggctcctcacgacaggggtctccagg tggaacacaagacagattgccctctgggaccaggaggacctctccatgcccctgatcgaa gaggaaattgatgggctctctggcctcctgttccccttctatgatgctgacacccacatg ctctacctggctggaaagggtgatggaaacatccggtactacgagatcagcactgagaag ccctacctgagttacctcatggagttccgctccccagccccgcagaaaggcctaggggtc atgcccaagcacgggctggatgtgtcagcctgcgaggtgttccgcttctacaagctggtg actctcaagggcctgatcgagcccatctccatgatcgtgccccggaggtcagattcctac caggaagacatttacccaatgacaccaggcacggagccagcactgaccccggatgaatgg ctgggaggcatcaaccgagatcccgtgctgatgtctttgaaagaaggctataagaagtcc tcaaaaatggtatttaaggctcccatcaaagaaaagaagagtgttgtggtcaacggaata gatttattagaaaatgtcccacccaggacagagaatgagctccttcgaatgttcttccgg cagcaggatgagattcgacggttgaaagaggagctggcccagaaggacatccgcattcgg cagctccagctggaactgaaaaacttgcgcaacagccccaagaactgttag >gi568815583r:68680084_68920751|GENSCAN_predicted_peptide_2|65_aa MSYHSSTKAVKIKETDNTKSCPGILLQSSATPLLRKLPVQVLIKDVCLPCYKQERKHYFL DCVFT >gi568815583r:68680084_68920751|GENSCAN_predicted_CDS_2|198_bp atgagttatcactcatccactaaagcagtgaaaattaaggagactgacaacaccaaaagt tgccctggcatactgctgcaaagctccgcaacgcctttgctcaggaagctccccgtgcag gttctcatcaaggatgtctgcctcccctgctacaaacaggaacgaaaacactacttcctg gattgcgtctttacataa >gi568815583r:68680084_68920751|GENSCAN_predicted_peptide_3|463_aa MAEPPWPAAQPAHWSLPPPAATAIGCCPETLGGDWLGPLPLVHGAADPASTAIRCQPLGP GAPQLEPPARLSQCWACRQPFGACLINGHGKLCQDASCPELAPVELDRHITYAALVGLLF RWGRQSVCLPLQSSSTYLFSLPASTEEAFEECTLARPQSCYLAHWTGVVYWWLSPPSRPP HLLLARTPTLAYCFFRVSSANSVAGVPHVERAGDLHAEHWVIQVKELVLDNSRSNEGKLE GLTDEFEELEFLSTINVGLTSIANLPKLNKLKKLELSDNRVSGGLEVLAEKCPNLTHLNL SGNKIKDLSTIEPLKKLENLKSLDLFNCEVTNLNDYRENVFKLLPQLTYLDGYDRDDKEA PDSDAEGYVEGLDDEEEDEDEEEYDEDAQVVEDEEDEDEEEEGEEEDVSGEEEEDEEGYN DGEVDDEEDEEELGGMAAFLLSAEEERGQKRKREPEDEGEDDD >gi568815583r:68680084_68920751|GENSCAN_predicted_CDS_3|1392_bp atggccgagcctccctggccggctgcgcagcccgcccattggtcccttcccccccccgcc gccactgccattggctgttgcccggagaccctcggcggcgattggctcgggccgctgccg ctcgtccatggggccgcagatcccgcctccacggcgatcagatgccagcctttgggccct ggtgccccccagctcgaacccccggcccgtctttctcagtgctgggcttgtcggcagcca ttcggcgcctgcttaataaatgggcacgggaaactttgtcaggacgcgagctgcccggag ctggcgcccgtggaactagacagacacatcacctatgctgctcttgtggggcttctgttc cgctgggggagacagtcagtatgcctacctcttcagtcctccagcacctacctcttcagt cttccagcatccactgaagaggcatttgaagaatgcaccctggcccggccccagtcctgc tacctggcacactggactggagttgtttactggtggctgtctcctcccagccggccacca cacctcttgctggcaaggaccccgaccttggcttactgtttcttccgtgtctcctcagcc aacagtgtcgcaggtgtgccccatgtggagagggctggagacctgcatgcagagcactgg gtcattcaggtgaaagaacttgtcctggacaacagtcggtcgaatgaaggcaaactcgaa ggcctcacagatgaatttgaagaactggaattcttaagtacaatcaacgtaggcctcacc tcaatcgcaaacttaccaaagttaaacaaacttaagaagcttgaactaagcgataacaga gtctcagggggcctggaagtattggcagaaaagtgtccgaacctcacgcatctaaattta agtggcaacaaaattaaagacctcagcacaatagagccactgaaaaagttagaaaacctc aagagcttagaccttttcaattgcgaggtaaccaacctgaacgactaccgagaaaatgtg ttcaagctcctcccgcaactcacatatctcgacggctatgaccgggacgacaaggaggcc cctgactcggatgctgagggctacgtggagggcctggatgatgaggaggaggatgaggat gaggaggagtatgatgaagatgctcaggtagtggaagacgaggaggacgaggatgaggag gaggaaggtgaagaggaggacgtgagtggagaggaggaggaggatgaagaaggttataac gatggagaggtagatgacgaggaagatgaagaagagcttggtggtatggcagccttttta ctctcagctgaagaagaaaggggtcagaagcgaaaacgagaacctgaagatgagggagaa gatgatgactaa >gi568815583r:68680084_68920751|GENSCAN_predicted_peptide_4|195_aa MVPALVKELPWEPLSPEEVQSVQEHLGHESDSLLFVQITSKKKKTKNNFEVSSSSQLKLS ITKKSSPSVKPAGYPAAAKLWTLSANDMEDDSMDLIDSEELLDPEDLKKPDPTSLQAAPC GEGKKRKTCQSCTRGLAEELEKEKSREQMSSQPKSACGNCYLGNSFHCASCPYLGIPAFK PGEKVLLSNSSLYHA >gi568815583r:68680084_68920751|GENSCAN_predicted_CDS_4|588_bp atggtccctgccctagtgaaagagctgccatgggagcccttaagccctgaggaggtacag tctgttcaagaacacctgggtcatgaaagtgacagcctgctctttgttcagatcacaagc aaaaaaaaaaaaacaaaaaacaactttgaagtgagttcttctagtcagcttaagctttcc atcaccaagaagtcatctccttcagtgaagcctgctgggtaccctgctgctgccaagctg tggaccctctcagccaatgatatggaggatgacagcatggatctcattgactcagaggag ctgctggatccagaagatttgaagaagccagatccgacttccctgcaggctgctccttgt ggggaagggaaaaagaggaagacctgccagagctgcacccgtggccttgccgaagaactg gaaaaagagaagtcaagggagcagatgagctcccaacccaagtcagcttgtggaaactgc tatctgggcaatagtttccactgtgccagctgcccctaccttgggataccagccttcaaa cctggggaaaaggtgcttctgagcaatagcagtctttaccatgcctag >gi568815583r:68680084_68920751|GENSCAN_predicted_peptide_5|174_aa MPCLVRRVSMRGRFGVSELEQHEDSCPAQGGKVSMQQGEGESQTRARQVAHEVGGPIVSW ASNSQRPLNLEFGPYIHPKEVLSVITEINQQGQKKEMCNLQLQRWIPLVEGTAVFCKEVF GDYCNILFPIKLPPSFSTDRQFSLQLGIPMMVANAGFLFPFYVYSMAFYCKEEL >gi568815583r:68680084_68920751|GENSCAN_predicted_CDS_5|525_bp atgccctgccttgtgaggagagtgtccatgcgggggcggtttggcgtgtcagagctggaa caacatgaagacagctgcccagcacagggtgggaaggtgtccatgcagcaaggggagggt gagagccagaccagagccaggcaagtggcccacgaagttggagggcctattgtgtcctgg gcttcaaattcacagaggcccttaaatttagaatttggaccttacatccatcccaaagag gttctatctgtgatcacagagataaaccaacaaggacagaagaaagagatgtgcaactta cagcttcagcgttggatcccattagtggaaggcactgctgtattttgtaaagaggtgttc ggagattattgcaatatcttgttccccatcaaacttccacccagctttagcactgatcga caattctcacttcaactgggcatccccatgatggttgccaatgctggttttctatttcca ttctacgtgtattcgatggcattctactgtaaagaagagctttga >gi568815583r:68680084_68920751|GENSCAN_predicted_peptide_6|492_aa MPLKNCAISLVIWKMQIKISMKLHTRKWRHKKPFKKINESRSWFFEKINKIDRPLARLIK KREKNQIDAIKNDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQ EEAESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFHQRYKEELISNFSKVSGYKINVQ KSQTFLYTNNRQTESQIMSELPFSIASKRIKYLGIHLTRDVKDLFKENYKPLLNEIKEDT NKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARI AKSILSQKNKAGGITLLDFKLYYKATVTKRACYWYQNRDIDQWNKTEPSEIMPHIYNYLI FDKPDKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKT LEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEW EKIFAPTHLTKG >gi568815583r:68680084_68920751|GENSCAN_predicted_CDS_6|1479_bp atgcccctgaaaaactgtgcaatatcattagtcatttggaaaatgcaaatcaaaatctca atgaaacttcacacccggaaatggagacacaaaaagcccttcaaaaaaatcaatgaatcc aggagctggttttttgaaaagatcaacaaaattgatagaccactagcaagactaataaag aaaagagagaagaatcaaatagacgcaataaaaaatgacaaaggggatatcaccaccaat cccacagaaatacaaactaccatcagagaatactataaacacctctacgcaaataaacta gaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaaccag gaagaagctgaatctctgaatagaccaataacaggctctgaaattgaggcaataattaat agcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaattccaccagagg tacaaggaggagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcacaaacattcttatacaccaataacagacaaacagagagccaaatcatgagtgaa ctcccattctcaattgcttcaaagagaataaaatacctaggaatccaccttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggacaca aacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaaatggcc atactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttc ttcacagaattggaaaaaactactttaaaattcatatggaaccaaaaaagagcccgcatt gccaagtcaatcctaagccaaaagaacaaagcaggaggcatcacgctacttgacttcaaa ctatactacaaggctacagtaaccaaaagagcatgctactggtaccaaaacagagatata gaccaatggaacaaaacagagccctcagaaataatgccacatatctacaactatctgatc tttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatggtgc tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccataaaaacc ctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatgtct aaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaacta aagagcttctgcacagcaaaagaaaccaccatcagagtgaacaggcaacctacagaatgg gagaaaatttttgcacctactcatctgacaaagggctaa