GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:21:23 Sequence gi568815594r:169891381_170105877 : 214497 bp : 41.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 21644 21688 45 1 0 89 107 20 0.668 4.82 1.02 Intr + 36697 36887 191 0 2 48 103 91 0.258 4.06 1.03 Intr + 39992 40065 74 2 2 90 31 50 0.234 -2.37 1.04 Intr + 40723 40886 164 2 2 59 81 142 0.319 9.37 1.05 Term + 48176 48205 30 1 0 65 42 49 0.005 -5.12 1.06 PlyA + 48570 48575 6 1.05 2.09 PlyA - 48590 48585 6 1.05 2.08 Term - 49510 49084 427 0 1 76 40 217 0.400 9.49 2.07 Intr - 51684 51633 52 1 1 39 110 54 0.497 -0.25 2.06 Intr - 54063 53971 93 1 0 94 56 103 0.011 6.72 2.05 Intr - 63023 62872 152 1 2 64 68 78 0.005 2.29 2.04 Intr - 67538 67468 71 2 2 92 45 57 0.004 -1.14 2.03 Intr - 71593 71477 117 1 0 50 60 77 0.089 0.84 2.02 Intr - 71847 71730 118 2 1 58 52 114 0.199 4.35 2.01 Init - 82804 82740 65 1 2 63 92 69 0.253 5.47 2.00 Prom - 90968 90929 40 -5.55 3.00 Prom + 92656 92695 40 -7.45 3.01 Init + 95996 96073 78 1 0 22 94 93 0.925 4.41 3.02 Term + 97005 97139 135 2 0 39 47 174 0.948 5.44 3.03 PlyA + 98425 98430 6 1.05 4.03 PlyA - 99109 99104 6 1.05 4.02 Term - 100929 99998 932 1 2 105 37 1226 0.992 110.51 4.01 Init - 104529 104310 220 0 1 99 34 148 0.896 9.34 4.00 Prom - 112283 112244 40 -3.05 5.07 PlyA - 113631 113626 6 1.05 5.06 Term - 114511 114189 323 2 2 84 39 180 0.132 6.70 5.05 Intr - 116095 116001 95 0 2 40 48 99 0.325 -0.21 5.04 Intr - 119110 119034 77 1 2 54 103 62 0.412 1.69 5.03 Intr - 120372 120221 152 1 2 27 95 63 0.363 -0.14 5.02 Intr - 123696 123530 167 2 2 72 78 98 0.381 5.88 5.01 Init - 124582 124485 98 1 2 54 53 103 0.441 3.23 5.00 Prom - 128628 128589 40 -6.05 6.00 Prom + 129741 129780 40 -8.65 6.01 Init + 134476 134948 473 0 2 65 31 520 0.060 39.04 6.02 Intr + 135928 136202 275 1 2 79 31 154 0.008 5.06 6.03 Intr + 137756 137906 151 0 1 17 33 169 0.005 2.60 6.04 Intr + 140470 140635 166 1 1 72 68 74 0.040 2.84 6.05 Term + 149258 149491 234 1 0 68 49 192 0.508 8.64 6.06 PlyA + 149507 149512 6 1.05 7.00 Prom + 153645 153684 40 -3.65 7.01 Init + 163797 163942 146 2 2 66 93 104 0.839 8.34 7.02 Term + 164034 164184 151 0 1 82 50 62 0.817 -1.80 7.03 PlyA + 164716 164721 6 1.05 8.12 PlyA - 167194 167189 6 1.05 8.11 Term - 168171 168064 108 0 0 38 32 113 0.083 -1.77 8.10 Intr - 170613 170512 102 0 0 36 89 74 0.433 1.75 8.09 Intr - 173445 173339 107 1 2 112 50 121 0.674 9.71 8.08 Intr - 175098 175034 65 2 2 93 105 71 0.991 6.74 8.07 Intr - 178263 178175 89 0 2 78 95 30 0.655 0.55 8.06 Intr - 181965 181756 210 0 0 109 57 170 0.722 14.19 8.05 Intr - 187203 187129 75 0 0 78 65 87 0.916 4.19 8.04 Intr - 195868 195736 133 0 1 63 116 7 0.660 0.73 8.03 Intr - 197184 197016 169 1 1 73 106 97 0.823 8.08 8.02 Intr - 198356 198232 125 1 2 86 12 192 0.581 10.71 8.01 Init - 198691 198405 287 1 2 66 42 157 0.926 3.50 8.00 Prom - 200893 200854 40 -8.85 9.00 Prom + 201358 201397 40 -9.25 9.01 Init + 202755 202854 100 2 1 75 28 65 0.542 -0.33 9.02 Intr + 206419 206594 176 2 2 64 55 158 0.774 8.84 9.03 Intr + 208438 208726 289 0 1 122 86 138 0.574 12.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:169891381_170105877|GENSCAN_predicted_peptide_1|167_aa MVKLQRKLCFCKALEVFIYQLQKGFIQSIQETINQVNSIGVGNGLAFTNYLDQRLPVQLF EMMEMLYLCIANAVATSFIHPVTYSSSISFGMFMMDPRNLGIKGWAPNWDQAFKHCASQD IKRWGYRMFMVHTACHNYWKKQHEDLGLIFEGQSIINMPAADWNPYW >gi568815594r:169891381_170105877|GENSCAN_predicted_CDS_1|504_bp atggtcaaactacagcgcaagctttgtttttgcaaagctctggaggtattcatatatcaa ttgcaaaagggcttcattcaaagcatacaagaaacaattaaccaggttaattcaatagga gttggaaatggattggcttttacaaactatctcgaccagcgccttccagtacaacttttt gagatgatggaaatgctctatctctgtattgccaatgcagtagccactagcttcatccat cctgtgacctacagctcctccataagctttggaatgttcatgatggatccaagaaatctt ggtatcaagggctgggcccccaactgggaccaggcttttaagcattgtgcatcacaggac ataaaacgctggggctacagaatgtttatggttcatactgcctgtcacaattactggaaa aagcaacacgaagaccttggacttatatttgaaggacagtccattattaatatgcctgct gctgactggaatccttactggtaa >gi568815594r:169891381_170105877|GENSCAN_predicted_peptide_2|364_aa MASSGTLQQMVSGTREREEPARMYSCKGVIQGPGGLDFPPAPKAHHSQPEDQTCGQQGEE ELFSKSPVNSESDNLLSKIKVMSLMVEFQADGDGQLDSGQLSEGDLAQWRGAQAEPDNLH LNPGALEHGEMMVSGFPVYAGFNDIGSGKRHSTLYPTTGSASSSTVFLQLQTSAGLQAHN NNSLLDTIPTTFTPKPDDDSQSEVQEESQDREGGNGEGEVTVRMERKATALRAAGPAWEQ SWPEKNTSLLKRPQRFDRNAAKLRAISRTPRILWGRISRGKPPGAVVAKSQPERSFSRLP HGHWPKALGPAQSSSPAWTPGRSRGQSEGLDESPFPLRPPCRTPRSLHPERKKKEIEHFP KMTV >gi568815594r:169891381_170105877|GENSCAN_predicted_CDS_2|1095_bp atggcatcatcagggaccctgcaacagatggtttccggcacaagagagagggaggagcca gccagaatgtattcttgcaagggagtcatccaaggcccaggaggacttgactttcctcct gcccctaaggcacatcactcacagcctgaggaccagacctgcggacagcagggagaggag gagctgttcagcaaatctccagttaactctgaatctgataacttattaagcaagatcaaa gtaatgtctctgatggtggaatttcaggcagatggtgatgggcaattggactcaggacag ttgtctgagggtgacctggcacagtggcgaggtgcgcaggctgagcctgacaacctgcat ttgaatcctggggctctggagcatggagagatgatggtttctgggttccccgtctatgca ggatttaatgatattgggtctgggaaaaggcactcaacgttatatcccaccacagggtca gcatcctcaagcactgtgttcttacagctccaaacatctgcagggctccaggcccacaat aataactctctgctggacactatccccaccaccttcacccctaaaccagatgatgattct caaagtgaggtccaggaggaaagtcaggacagagaaggaggaaatggggaaggagaagtg actgtcagaatggagaggaaggccactgcactgagagcagcgggcccagcctgggagcag tcatggccggagaaaaacacgtccctcctaaagaggcctcagagatttgacagaaatgca gctaaactcagagcaatttctagaactcctaggatcctgtggggtcgcattagcagggga aaaccaccaggggctgttgtagccaagagccaaccagaaagaagtttctcacgtttacct catggacactggccaaaggccctggggccagcccagagcagcagccccgcatggacacca ggaagatccagaggacagtcagaaggtctagacgagagcccttttcccctgaggccacca tgtcgcactcccaggtccctacatccagagaggaagaaaaaggagattgaacattttcct aaaatgactgtttaa >gi568815594r:169891381_170105877|GENSCAN_predicted_peptide_3|70_aa MPKGYVKSGKGKYLNNAEKTEYLKCKNFKKNIARDMTRMTQKPEIDFTNDYIRPKGRGFG FSVKLLKLPK >gi568815594r:169891381_170105877|GENSCAN_predicted_CDS_3|213_bp atgccaaaagggtacgtgaagagcggtaaaggaaagtatctgaacaatgctgagaaaaca gagtatctgaaatgtaagaatttcaagaaaaacattgctagagatatgaccaggatgact cagaagccagaaattgattttacaaacgactacatacgacccaaaggacgaggctttgga tttagtgtgaagcttttaaaactgcccaaatga >gi568815594r:169891381_170105877|GENSCAN_predicted_peptide_4|383_aa MAAYVVSSKIIACRTDLASATDILGFICAEAMQLHEHSVHKLDIPTQLQFKIHGDCTVYP HFACKAPKICDSHRKWQMHDSGLLNITKVSFSDRGKYTCVASNIYGTVNNTVTLRVIFTS GDMGVYYMVVCLVAFTIVMVLNITRLCMMSSHLKKTEKAINEFFRTEGAEKLQKAFEIAK RIPIITSAKTLELAKVTQFKTMEFARYIEELARSVPLPPLIMNCRTIMEEIMEVVGLEEQ GQNFVRHTPEGQEAADRDEVYTIPNSLKRSDSPAADSDASSLHEQPQQIAIKVSVHPQSK KEHADDQEGGQFEVKDVEETELSAEHSPETAEPSTDVTSTELTSEEPTPVEVPDKVLPPA YLEATEPAVTHDKNTCIIYESHV >gi568815594r:169891381_170105877|GENSCAN_predicted_CDS_4|1152_bp atggctgcctatgttgtttcttccaaaatcattgcatgtagaactgacctggcatctgct actgacattttgggttttatttgtgctgaagcaatgcagcttcatgaacattcagtgcat aagcttgacatacccactcagttacagtttaaaatacatggggactgcactgtgtatccc cacttcgcgtgtaaagccccgaagatctgtgatagtcaccgaaaatggcaaatgcacgac agcggcctcctgaacatcaccaaggtatccttctcagaccgaggtaaatacacgtgtgtg gcttctaacatctacggcaccgtgaacaacacggtgaccttgcgcgtcatcttcacttct ggagacatgggtgtctactacatggtcgtgtgcctggtggccttcaccatcgtcatggtc ctcaatatcacccgcctgtgcatgatgagcagccatctaaagaagactgagaaggccatc aatgagttctttaggaccgaaggtgcagagaagctgcagaaggcatttgagatcgccaag cgcatccccatcatcacctccgccaaaactctagagcttgccaaagtcacccagttcaaa accatggagttcgcccgctacatcgaagagcttgccaggagcgtgcctctgccgcctctc attatgaactgcaggactatcatggaggagattatggaggtggttgggctggaggagcag gggcagaattttgtgaggcatactccagagggccaggaggccgcagacagggatgaggtc tacacaatccccaactctctgaagcggagcgactcccctgccgctgactcggacgcctca tcgctgcacgagcaacctcagcaaattgccatcaaggtgtcagttcacccgcagtccaaa aaagagcatgcagatgaccaagagggtggacagtttgaagtcaaagatgtagaggagaca gaactgtcggcggaacattcccccgaaactgcagaaccttctaccgatgtcacgtccacc gagctaacatctgaagagccaacacctgttgaggtaccagataaggtactgccgccagct tacctggaagccacagagccagcagtgacacatgacaaaaacacctgcattatttacgaa agccatgtctaa >gi568815594r:169891381_170105877|GENSCAN_predicted_peptide_5|303_aa MKVSRPKDTGLGQTPCGTARIYTTSSSWSEEERHLILSICCIKISAPRLQETCSEFFTLA VAKNTRCVSVRSYSVSQLGTINTILQAHPVTLSCNCRYMFLCPLPDSDKRGESCVPFRIL TRGGRPQPLAVHPAQVISQVVDVEEGEGEAGTLSVTSVKKEKTCIHTASEGGYVQNPGLV SLKPEAMLFTAILYRIEQAKKMDRLKSHLTVCFLPSVPFLILVSTLATAKSVTNSTLNGT NVVLGSVPVIIARTDHIIVKEGNSALINCSVYGIPDPQFKWYNSIGKLLKEEEDEKERGG GRL >gi568815594r:169891381_170105877|GENSCAN_predicted_CDS_5|912_bp atgaaggtttcaaggcccaaagacacgggccttggtcagaccccctgtggaacagctcgg atttacacaaccagcagttcttggtctgaagaggaaaggcacctgatcctatccatttgc tgtattaaaatctcagctccacggctgcaggagacctgcagtgaattcttcacactggct gttgctaagaacactcgatgtgtttctgttagaagttattctgtgtctcagttgggtact attaatactatccttcaggctcatcctgtcacactgtcctgtaactgtcgatatatgttc ctgtgtcccctgccggattctgacaagaggggagagtcctgtgtccccttccggattctg acaagagggggaaggcctcaacccctggcagtgcacccagcgcaggtcatttctcaggtg gtggatgtggaagagggggaaggagaggcaggcacactcagtgtaacttctgttaaaaaa gaaaaaacctgcatacacacagccagcgaaggtggttatgtccagaatcctggcctggtc tctctcaaacctgaggccatgctcttcacagctattctgtatcgaatagagcaagcaaag aagatggatcgattgaagagccatctgactgtgtgctttctaccttctgtgcccttttta atcctagtatccactctagccaccgctaagagtgtgactaacagcactttaaatggcact aacgtggtcttgggctctgtgcccgtaatcattgccagaactgaccatatcatagtcaag gaagggaacagtgccttgattaactgtagtgtttatggcatccctgacccacagttcaag tggtataattccattggcaagctgctgaaagaagaagaggatgagaaggagagaggagga ggtaggctttaa >gi568815594r:169891381_170105877|GENSCAN_predicted_peptide_6|432_aa MREIQSPAKAPTAVRRPLPPRHLGGSPAAGSGFRAPRPRKVTVRAQDAAVPKRPAPAPHG SRSILRPPGARCPDPAASRSAASAAVSASPNTRKFGGPGADSAADPVRGWLPPVPLRPGP PGAPANLTPPRHSGGRRAPLAMASPTAAAAPHSAPDARARPLGHTPLRPFYSRTALFPVG CCENPERWTFHIDAPGGSGSQTGHVPPPGQALLCQERSFSNSQSLCSSLFLANWQSSASD PTPRGRTCRTAIIREDDSSQEEETTEHLQWGGAKDENLEEEDEKKQDDLQLLFCLYLLAW KEGKPDIHFPTIQWRMLELGDPGKQISKETNISSQHMGKQTRKEANIFSKNVAHQLSQGR SEDGESLEDQLQRRAILSVESWEDDGTTSCREELQYLLRTEHTLGGGSSLLRAAELMGQP SCRKEPPTVALL >gi568815594r:169891381_170105877|GENSCAN_predicted_CDS_6|1299_bp atgagagagatccagagcccagcgaaagcccccacggctgttcggcgtccccttccccca cgacacctaggaggctcgccggcagctggcagtggattccgcgctcctcgccctcggaag gttacggttcgagcccaggatgcggcagtccccaaacggccagcccccgccccgcacgga tctcgaagcatcctgcgaccccctggagcccgctgcccggaccctgcggcgtctcgcagc gcagcctccgccgcagtctcagccagcccaaacacccgaaagttcggcggccccggcgcc gactcggccgccgacccggtgcggggctggctcccacccgtgcccctccgccccggcccg ccgggggcccccgctaacctgacaccgccgcgccactcaggtggccgccgtgcacccctc gccatggccagcccgacagcggccgctgcgccgcactctgcgccggacgcccgtgcccgc cctcttggccacaccccactccggcctttttactcccgtaccgccctcttcccggtgggt tgctgcgaaaacccggaaaggtggactttccacattgacgctcccggcggctccggcagc cagactgggcatgttccgccgccgggtcaggcattgctgtgccaagagaggagctttagc aactctcagtctctctgctcctctctgttcctggcaaactggcaaagtagtgcctctgac cccacgccccgaggtcggacctgcagaacggcaatcatcagagaggatgacagctctcag gaagaagagaccacagaacacctacaatggggcggtgcaaaggatgagaatctggaggag gaggatgagaagaaacaggatgacctgcaattgctcttctgcttgtatttactagcttgg aaggaaggaaagccagatatccactttcccactatacaatggaggatgctggagttgggt gacccagggaaacaaataagcaaggaaacaaacatcagcagtcaacatatgggaaaacaa acacgcaaagaagccaacatattctccaagaatgtggcccaccagctcagccagggcaga tcagaggatggagagtcactagaggaccagctgcagagaagagctatcctatctgtggag agctgggaagatgatgggaccaccagctgcagagaggagctacagtatctgctgagaact gagcacacgttgggaggaggctcctctctgctgagagctgcagagttgatgggacaacct tcctgcagaaaggagccacccactgtggctctcctctga >gi568815594r:169891381_170105877|GENSCAN_predicted_peptide_7|98_aa MPHEDWSDAATSQGNTKPRRWPGTDPSLRLQREHGPAGTLILDFKIPERILDCYGSSNRP DVVPPQGFCIASSGCLVSLHKYLHGPLPYSSLLKCRLF >gi568815594r:169891381_170105877|GENSCAN_predicted_CDS_7|297_bp atgcctcatgaagactggagtgatgctgccacaagccaaggaaataccaagcctaggaga tggcctggaacggatccttccctacgccttcagagggagcatggccctgcaggcaccttg atcctagacttcaagattccagaacgcattctggactgttatggttcctcaaacaggcca gatgtagttcctcctcagggcttctgcattgcaagttcaggctgcctggtatctttgcac aagtatttgcatggcccactcccttactcctctctgctcaaatgccgcttgttctag >gi568815594r:169891381_170105877|GENSCAN_predicted_peptide_8|489_aa MIRDGEARAAGGHRPRRGAGGGAARPAAGQNRPGPWRPQERVRRGLCRARHTAPDPFRTG RPNAASSRGVSAAFSGHRAREARRSGGTHLAVALPDPQPLKKKLDAWLSEDMNYARFITA ASAARNPSPIRTMSEKRADILSRGPKSMISLAGGLPNPNMFPFKTAVITVENGKTIQFGE EMMKRALQYSPSAGIPELLSWLKQLQIKLHNPPTIHYPPSQGQMDLCVTSGSQQGLCKVF EMIINPGDNVLLDEPAYSGTLQSLHPLGCNIINVASDESGIVPDSLRDILSRWKPEDAKN PQKNTPKFLYTVPNGNNPTGNSLTSERKKEIYEDQRSLSEHPSVLLAILIMVLFKLVFLE SLLVIDFYSNQKDAILAAADKWLTGLAEWHVPAAGMFLWIKVKGINDVKELIEEKAVKMG VLMLPGNAFYVDSSAPSPYLRASFSSASPEQMDVATSGNYVGLIEETQGLEEDQDEEPDN SLHCGNRKL >gi568815594r:169891381_170105877|GENSCAN_predicted_CDS_8|1470_bp atgatccgggacggggaggcgcgcgccgcgggcggccataggccacggcgcggggcggga gggggcgcggcgaggcccgcggcggggcaaaaccggcctgggccctggcggccgcaggag cgcgtgcggcgtggactttgccgggctcgccacacagccccagacccgtttaggaccggg agaccgaacgcagcgtccagccggggagtttcggcggcgttctccgggcaccgcgcgcgg gaagccagacgcagcggggggacacatctcgcggtggcgttgccagatcctcaaccactg aagaagaagcttgatgcttggctgtcagaagacatgaattacgcacggttcatcacggca gcgagcgcagccagaaacccttctcccatccggaccatgagtgagaaacgggctgacata ttgagcagaggaccaaaatcgatgatctccttggctggtggcttaccaaatccaaacatg tttccttttaagactgccgtaatcactgtagaaaatggaaagaccatccaatttggagaa gagatgatgaagagagcacttcagtattctccgagtgctggaattccagagcttttgtcc tggctaaaacagttacaaataaaattgcataatcctcctaccatccattacccacccagt caaggacaaatggatctatgtgtcacatctggcagccaacaaggtctttgtaaggtgttt gaaatgatcattaatcctggagataatgtcctcctagatgaacctgcttattcaggaact cttcaaagtctgcacccactgggctgcaacattattaatgttgccagtgatgaaagtggg attgttccagattccctaagagacatactttccagatggaaaccagaagatgcaaagaat ccccagaaaaacacccccaaatttctttatactgttccaaatggcaacaaccctactgga aactcattaaccagtgaacgcaaaaaggaaatctatgaggatcagcgttctctctctgaa cacccttcagtgctgctggccatcctcattatggtcttgttcaaactagtgtttcttgag tccctcctggttattgatttctatagtaaccagaaggatgcaatactggcagctgcagac aagtggttaactggtttggcagaatggcatgttcctgctgctggaatgtttttatggatt aaagttaaaggcattaatgatgtaaaagaactgattgaagaaaaggccgttaagatgggg gtattaatgctccctggaaatgctttctacgtcgatagctcagctcctagcccttacttg agagcatccttctcttcagcttctccagaacagatggatgtggcaaccagtggtaattat gtaggactgattgaagagacgcagggcctggaggaggaccaagatgaggaacctgacaat tctcttcactgtgggaacagaaaattataa >gi568815594r:169891381_170105877|GENSCAN_predicted_peptide_9|189_aa MERQRNGPSGQANGMQGHWGVMISKCKAEYSHSGCHRWAPTALGSSTYVALKGIAPLLAA FTGRCWVSVAFSGAGVECLLSQAGVEGLRLFQAWNILNGYQGHHVEQCFLDLVSSWHLDL AQCYRITCFISWSSCFNCAQEMAGFLRENRHMSLRIFTACTYDYHPGYEERLHMLQGTGA QISIMTSMX >gi568815594r:169891381_170105877|GENSCAN_predicted_CDS_9|567_bp atggagaggcagagaaatggcccatcagggcaggctaatggaatgcaggggcactggggg gtcatgatcagcaaatgcaaggcagaatatagtcacagtggatgtcacagatgggctccc acagccttgggcagctccacctatgtggctttgaagggtatagccccgcttctggctgct ttcacaggccggtgttgggtgtctgtggctttttcaggtgctggtgttgagtgtctgctt tcacaggctggtgttgagggtctgcggctcttccaggcttggaatattctcaatggctat caaggccaccatgtggagcagtgcttcctggacctggtttcttcttggcatctggacctg gcccagtgctacaggatcacctgcttcatctcctggagctcctgcttcaactgtgcccag gaaatggctgggttcctgagggagaatagacacatgagcttgcgcatcttcactgcctgc acctatgattaccacccaggatatgaggagaggctgcacatgctgcagggaactggggcc caaatttccatcatgacctctatggnn