GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:29:02 Sequence gi568815588r:70045420_70246586 : 201167 bp : 44.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1686 1725 40 -2.46 1.01 Init + 1810 1942 133 0 1 78 47 82 0.586 3.40 1.02 Intr + 2486 2542 57 0 0 82 79 36 0.449 0.96 1.03 Term + 7429 7856 428 2 2 55 54 128 0.252 1.57 1.04 PlyA + 8417 8422 6 1.05 2.00 Prom + 28120 28159 40 -3.16 2.01 Init + 30240 30411 172 2 1 76 95 187 0.586 17.80 2.02 Intr + 44641 44747 107 2 2 83 91 209 0.992 20.63 2.03 Intr + 46338 46535 198 2 0 77 102 227 0.999 22.55 2.04 Intr + 47882 48021 140 1 2 9 46 91 0.765 -3.74 2.05 Intr + 48183 48426 244 0 1 98 87 110 0.848 9.40 2.06 Intr + 50235 50334 100 2 1 86 82 125 0.983 11.38 2.07 Intr + 54789 54878 90 1 0 104 115 56 0.994 9.87 2.08 Intr + 62322 62399 78 1 0 100 24 77 0.425 2.02 2.09 Intr + 63614 63788 175 0 1 65 110 212 0.983 20.10 2.10 Term + 66099 66264 166 0 1 65 42 410 0.997 31.49 2.11 PlyA + 66834 66839 6 1.05 3.10 PlyA - 66875 66870 6 1.05 3.09 Term - 68910 68759 152 1 2 110 45 151 0.999 11.07 3.08 Intr - 69701 69501 201 0 0 60 94 435 0.990 40.56 3.07 Intr - 71355 71203 153 1 0 94 85 318 0.967 32.14 3.06 Intr - 72501 72351 151 0 1 135 85 126 0.832 16.74 3.05 Intr - 75180 75088 93 0 0 107 83 77 0.979 9.26 3.04 Intr - 75792 75673 120 0 0 97 99 102 0.684 12.89 3.03 Intr - 77457 77347 111 0 0 38 99 35 0.526 0.18 3.02 Intr - 78101 77986 116 0 2 138 105 139 0.999 20.77 3.01 Init - 78665 78488 178 2 1 57 95 301 0.995 25.12 3.00 Prom - 83956 83917 40 -6.66 4.05 PlyA - 89031 89026 6 1.05 4.04 Term - 94722 94505 218 1 2 88 54 217 0.819 15.61 4.03 Intr - 97434 97249 186 1 0 128 19 141 0.859 10.86 4.02 Intr - 98553 98423 131 2 2 88 98 160 0.999 17.34 4.01 Init - 101143 100002 1142 1 2 72 82 1218 0.490 111.08 4.00 Prom - 104991 104952 40 -6.56 5.04 PlyA - 105608 105603 6 1.05 5.03 Term - 107173 107057 117 1 0 86 41 173 0.999 10.94 5.02 Intr - 108550 108419 132 1 0 88 116 89 0.968 12.54 5.01 Init - 109242 109240 3 0 0 98 53 0 0.696 -2.50 5.00 Prom - 113321 113282 40 -1.26 6.00 Prom + 113545 113584 40 -2.06 6.01 Sngl + 118417 118635 219 0 0 60 36 230 0.959 10.16 6.02 PlyA + 120376 120381 6 1.05 7.00 Prom + 121152 121191 40 -5.06 7.01 Sngl + 124688 125416 729 1 0 93 44 200 0.871 12.33 7.02 PlyA + 126150 126155 6 1.05 8.13 PlyA - 127123 127118 6 1.05 8.12 Term - 129682 129530 153 1 0 56 42 107 0.609 0.82 8.11 Intr - 148130 148005 126 2 0 49 79 51 0.120 1.18 8.10 Intr - 160914 160845 70 2 1 85 34 68 0.265 0.28 8.09 Intr - 163871 163786 86 2 2 56 111 48 0.938 2.52 8.08 Intr - 164266 164139 128 2 2 117 85 167 0.999 19.70 8.07 Intr - 168170 168044 127 2 1 74 95 147 0.999 14.25 8.06 Intr - 169167 169081 87 0 0 61 90 59 0.921 3.57 8.05 Intr - 172512 172393 120 0 0 45 91 104 0.905 7.09 8.04 Intr - 173398 173345 54 1 0 136 75 4 0.659 3.08 8.03 Intr - 184980 184922 59 1 2 95 116 13 0.718 3.40 8.02 Intr - 188006 187845 162 0 0 53 70 178 0.681 12.45 8.01 Init - 193405 193330 76 1 1 68 78 78 0.549 4.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 31435 31619 185 2 2 115 42 77 0.819 3.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:70045420_70246586|GENSCAN_predicted_peptide_1|205_aa MEYYAAIKKDEFMSFIGTWMKLETIILSKLSQGQKTKHRMFSLIENNQRLEINPMGLYLP NQNKNCRLPPLTRTDRTQRKAKSPARHRRRVGQTCARGGARRGHRAPGAGGAAPGEPAMA MALLFRVLRGLETKRAAGAVQGPERTTSASPPSPCGRTLDSRPTLCKNKFLNPAAQLASL GDKGGAAKEVVKFLRPARGRGRRGY >gi568815588r:70045420_70246586|GENSCAN_predicted_CDS_1|618_bp atggaatactatgcagccataaaaaaggatgagttcatgtccttcatagggacatggatg aagctagaaaccatcattctgagcaaactatcacaaggacagaaaaccaaacaccgcatg ttctcactcatagaaaataatcaaaggcttgagatcaaccccatgggcctatatctaccc aatcagaacaagaactgccgcttgccgccattgacacgcacagatagaacccaaagaaag gcaaagagtcctgcccggcaccggcgccgcgtgggccaaacctgcgcccgtggaggggcg cgcagagggcaccgggcgccgggagcaggcggcgcagcaccaggtgagcccgccatggcg atggcattattgtttagggtgctccggggtctggaaacaaaacgcgccgccggggcagtg caggggccggagagaaccacctctgcctctcccccgagcccctgcgggcgcaccctcgat agccgacccaccctctgcaaaaacaaattcttaaatccagctgcccaacttgcttccttg ggcgataaagggggggctgctaaagaagtggtcaaattcctgcggccagcgaggggccgg gggcgtcgagggtactga >gi568815588r:70045420_70246586|GENSCAN_predicted_peptide_2|489_aa MSGRSGKKKMSKLSRSARAGVIFPVGRLMRYLKKGTFKYRISVGAPVYMAAVIEYLAAEI LELAGNAARDNKKARIAPRHILLAVANDEELNQLLKGVTIASGGVLPRIHPELLAKKRGT KGKSETILSPPPEKRGRKATSGKKGGKKSKAAKPRTSKKKVLLDNPTVEVVDVSFHTASV PREILGRKKAKQWAINGPTQLRCLSSNSRTHHLTTSFHIQLGGSREKSLIKAPQAFSGLV LILPFDFYRQSKPKDSDKEGTSNSTSEDGPGDGFTILSSKSLVLGQKLSLTQSDISHIGS MRVEGIVHPTTAEIDLKEDIGKALEKAGGKEFLETVKELRKSQGPLEVAEGHVEIIKWQS LSRNYTIKFSGSQLIEAAVSQSSGLAAKFVIHCHIPQWGSDKCEEQLEETIKNCLSAAED KKLKSVAFPPFPSGRNCFPKQTAAQVTLKAISAHFDDSSASSLKNVYFLLFDSESIGIYV QEMAKLDAK >gi568815588r:70045420_70246586|GENSCAN_predicted_CDS_2|1470_bp atgtcgggccggagtgggaagaagaaaatgtccaagctgtcccgttcagctagggcaggt gtcatctttccagtggggaggctgatgcgttatctgaagaaagggacgttcaagtaccgg atcagcgtgggcgcccctgtctacatggcggcagtcattgagtacctggcagcggaaatt ctagaattggccggcaatgccgcgagggacaacaagaaggcccggatagccccgagacac atcttgctggcagttgccaatgacgaggagctcaaccagctgctaaaaggagtgaccatc gccagtggaggcgtcctgcccagaattcaccccgaactgctggccaaaaagcgagggacc aaaggcaagtcggaaacgatcctctccccacccccagagaaaagaggcaggaaggccacg tcaggcaagaagggggggaagaaatccaaggctgccaaaccacggacgtccaaaaagaaa gttctattggacaaccctactgtagaagttgtggatgtttcattccacactgcctctgtg ccccgagaaatcctggggagaaagaaggctaagcaatgggctatcaacggccccacccag ctccgatgcttgtcgtcaaacagcagaacacatcacctaaccacttcctttcatattcaa cttggtggcagccgtgagaagagcttaataaaggcacctcaggctttttctggattggtt ctcatcttgccttttgatttttaccgacagtccaaaccaaaggacagcgataaagaagga acttcaaattccacctctgaagatgggccaggggatggattcaccattctgtcttctaag agccttgttctgggacagaagctgtccttaacccagagtgacatcagccatattggctcc atgagagtggagggcattgtccacccaaccacagccgaaattgacctcaaagaagatata ggtaaagccttggaaaaggctgggggaaaagagttcttggaaacggtaaaggagcttcgc aaatcccaaggccctttggaagtcgccgaagggcatgtggagatcatcaaatggcagtcc ctgtcaaggaactacacaatcaagttcagtggttctcaactcatagaagccgccgtcagc caatccagtggactcgcagccaaatttgtcatccactgtcacatccctcagtggggctcc gacaaatgtgaagaacagcttgaagagaccatcaaaaactgcctgtcagcggcggaggac aagaagctaaagtccgtcgcgttcccgcctttccccagcggcagaaactgctttcccaaa cagactgcggcccaggtgaccctcaaagccatctcagcccactttgatgactcgagcgcg tcctcgctgaagaacgtgtacttcctgctcttcgacagcgagagcatcggcatctacgtg caggagatggccaagctcgacgccaagtag >gi568815588r:70045420_70246586|GENSCAN_predicted_peptide_3|424_aa MGSQVSVESGALHVVIVGGGFGGIAAASQLQALNVPFMLVDMKDSFHHNVAALRASVETG FAKKTFISYSVTFKDNFRQGLVVGIDLKNQMVLLQGGEPLRKSHECLPCLYGEGGITSFY REAKGLNGLLKIPQLALPFSHLILATGSTGPFPGKFNEVSSQQAAIQAYEDMVRQVQRSR FIVVVGGGSAGVEMAAEIKTEYPEKEVTLIHSQVALADKELLPSVRQEVKEILLRKGVQL LLSTCTLTSLPALPWPGERVSNLEELPLNEYREYIKVQTDKGTEVATNLVILCTGIKINS SAYRKAFESRLASSGALRVNEHLQVEGHSNVYAIGDCADVRTPKMAYLAGLHANIAVANI VNSVKQRPLQAYKPGALTFLLSMGRNDGVGQISGFYVGRLMVRLTKSRDLFVSTSWKTMR QSPP >gi568815588r:70045420_70246586|GENSCAN_predicted_CDS_3|1275_bp atggggtcccaggtctcggtggaatcgggagctctgcacgtggtgattgtgggtgggggc tttggcgggatcgcagcagccagccagctgcaggccctgaacgtccccttcatgctggtg gacatgaaggactccttccaccacaatgtggctgctctccgagcctccgtggagacaggg ttcgccaaaaagacattcatttcttactcggtgactttcaaggacaacttccggcagggg ctagtagtggggatagacctgaagaaccagatggtgctgctgcagggtggcgagccttta cgtaagtctcatgaatgtttaccgtgcctctatggcgaaggtggtataacttcattttac agagaagctaaagggttaaatggcttgcttaagatcccacagctggccctgcccttctct catcttatcctggccacgggcagcactgggcccttcccgggcaagtttaatgaggtttcc agccagcaggccgctatccaggcctatgaggacatggtgaggcaggtccagcgctcacgg ttcatcgtggtggtgggaggaggctcggctggagtggagatggcagcagagattaaaaca gaatatcctgagaaagaggtcactctcattcactcccaagtggccctggctgacaaggag ctcctgccctccgtccggcaggaagtgaaggagatcctcctccggaagggcgtgcagctg ctgctgagtacgtgcaccctcacctccctccctgccctgccctggcctggtgagcgggtg agcaatctggaggagctgcctctcaatgagtatcgagagtacatcaaagtgcagacggac aaaggcacagaggtggccaccaacctggtgattctctgcaccggcatcaagatcaacagc tccgcctaccgcaaagcgtttgagagcagactagccagcagtggtgctctgagagtgaac gagcacctccaggtggagggccacagcaacgtctacgccattggtgactgtgccgacgtg aggacgcccaagatggcctatcttgccggcctccacgccaacatcgccgtggccaacatc gtcaactctgtgaagcagcggcctctccaggcctacaagccgggtgcactgacgttcctc ctgtccatggggagaaatgacggtgtgggccaaatcagtggcttctatgtgggccggctc atggttcggctgaccaagagccgggacctgttcgtctctacgagctggaaaaccatgagg cagtctccaccttga >gi568815588r:70045420_70246586|GENSCAN_predicted_peptide_4|558_aa MRAAEQAGCMVSASRAGQPEAGPWSCSGVILSRSPGLVLCHGGIFVPFLRAGSEVLTAAG AVFLPGDSCRDDLRLHVQWAPTAAGPGGGAERGRPGLCTPQCASLEPGPPAPSRGRPLQP RLPAELLLLLSCPAFWAHFARLFGDEAAEQWRFSSAARDDEVSEDEEADQLRALGWFALL GVRLGQEEVEEERGPAMAVSPLGAVPKGAPLLVCGSPFGAFCPDIFLNTLSCGVLSNVAG PLLLTDARCLPGTEGGGVFTARPAGALVALVVAPLCWKAGEWVGFTLLCAAAPLFRAARD ALHRLPHSTAALAALLPPEVGVPWGLPLRDSGPLWAAAAVLVECGTVWGSGVAVAPRLVV TCRHVSPREAARVLVRSTTPKSVAIWGRVVFATQETCPYDIAVVSLEEDLDDVPIPVPAE HFHEGEAVSVVGFGVFGQSCGPSVTSGILSAVVQVNGTPVMLQTTCAVHSGSSGGPLFSN HSGNLLGIITSNTRDNNTGATYPHLNFSIPITVLQPALQQYSQTQDLGGLRELDRAAEPV RVVWRLQRPLAEAPRSKL >gi568815588r:70045420_70246586|GENSCAN_predicted_CDS_4|1677_bp atgagggcggccgagcaggcgggctgcatggtgagcgcctcccgggccggacagcccgag gcgggcccgtggagctgcagcggggtaatcctgagccgtagcccgggcctggtgctttgc cacgggggcatcttcgtccccttcctgcgagctggcagcgaagtcctgaccgcggccggc gccgtcttcctgcctggcgacagttgcagggacgacctgcgcctgcacgtgcagtgggcc ccaacggccgcgggtcccgggggcggcgcggagcggggccgcccagggctgtgcacgccc cagtgcgcgagcctcgagcccggcccacctgccccgtcccgcgggcgtcccctgcagccc cggcttcctgctgagctgctgctgctgctgagctgcccggccttctgggcccacttcgcg cgcctcttcggggacgaggcagcggaacagtggcgcttctcgagcgcggcgcgggatgac gaagtgtcggaggacgaggaggcggatcaactgagagcgctgggctggtttgcgctgctg ggcgtgcggctaggccaggaggaggtggaggaggagcgcgggccagccatggcggtgtcg cctctcggggccgtgcccaagggtgcgccattgctggtctgcggctcccctttcggcgcc ttctgccccgacatctttctcaacacgctgagctgcggggtgctcagcaacgtggccggc ccactgctgcttaccgacgcacgctgcctgcccggcaccgagggcggcggcgtgttcacc gcgcggcccgcgggggcgctggtggcgctggtggtggcgccgctctgttggaaggccggc gaatgggtgggcttcacgctgctctgcgccgccgccccccttttccgcgccgcccgcgac gcgcttcaccgcctgccgcacagcaccgctgccctggccgcccttctgccgccagaggtg ggcgtcccgtggggtctgcccctccgagactccgggcccctgtgggcagccgcggcagtg ttggtggagtgcggcaccgtatggggctccggagtggctgtggcaccccgccttgtagtg acctgtcggcacgtgtcccctcgggaagcagccagggtcctggtgcgctccaccaccccc aagagtgtggccatctggggccgtgtggtatttgccactcaggagacatgtccctatgac atagcagtggtgagcctggaggaggacctggatgatgtccccatccctgtgcccgctgag cacttccatgaaggcgaggctgtgagtgtggtgggctttggcgtctttggccagtcttgc gggccctcggtgacctcaggcatcctttcggctgtggtgcaggtgaatggcacgcccgta atgctgcagaccacgtgtgctgtgcacagcggctccagtgggggacccctcttctccaac cactcaggaaacctccttggcataatcaccagcaacacccgggacaataatacgggggcc acctacccccacctgaacttcagcattcccatcacggtgctccagccggccctgcagcag tacagccagacccaagacctaggtggcctccgtgagctggaccgcgctgctgagccagtc agggtggtgtggcggttgcagcggcccctggcagaggccccgcggagcaagctctga >gi568815588r:70045420_70246586|GENSCAN_predicted_peptide_5|83_aa MALMTDETISNVPILILGNKIDRTDAISEEKLREIFGLYGQTTGKGNVTLKELNARPMEV FMCSVLKRQGYGEGFRWLSQYID >gi568815588r:70045420_70246586|GENSCAN_predicted_CDS_5|252_bp atggctttaatgactgatgaaacaatatccaatgtgccaatccttatcttgggtaacaaa attgacagaacagatgcaatcagtgaagaaaaactccgtgagatatttgggctttatgga cagaccacaggaaaggggaatgtgaccctgaaggagctgaatgctcgccccatggaagtg ttcatgtgcagtgtgctcaagaggcaaggttacggcgagggtttccgctggctctcccag tatattgactga >gi568815588r:70045420_70246586|GENSCAN_predicted_peptide_6|72_aa MINEVDADGNGTIDFPEFLTMMARKMKDTDSEEEIRETFCVFDKDGNGYISGVELHHVMT NLGVKLTDEEVD >gi568815588r:70045420_70246586|GENSCAN_predicted_CDS_6|219_bp atgattaatgaagtagatgctgatggtaatggcacaattgacttccctgaatttctgaca atgatggcaagaaaaatgaaagacacagacagtgaagaagaaattagagaaacattctgt gtgtttgataaggatggcaatggctatattagtggtgtagaacttcaccatgtgatgaca aaccttggagtgaagttaacagatgaagaagttgattaa >gi568815588r:70045420_70246586|GENSCAN_predicted_peptide_7|242_aa MEPFRKPERTVQTRNCYPAQQLRDPRAECKWTPGKEDSRGRSRSGQVVPSPPRISPAAPQ NRLRVTSPPPNPARASLRSDLPAFRPSPSSPNSPPRRDPQTHGLRGSSGKTAAGSDPPSE HTNQQHPGPPATRRMYVTPLPGTSGRVMGPAPSQRYASLPRCRPTSDSGRGGAGIRRDKA RSGSGPASAPPAPVWLADRARRGSAGTLGPGLGPGVPERPGTLGIAASHSRRTRKVRSGY RR >gi568815588r:70045420_70246586|GENSCAN_predicted_CDS_7|729_bp atggagccatttcgtaaaccggagcgcacggtacagacaagaaactgttatccggcccag cagctgcgagacccccgagcggaatgcaagtggaccccaggcaaagaggacagcagggga cggagcagatctggccaagttgtcccgtccccgcccagaatcagccccgcggccccacag aaccgcctgcgtgtcacttcccccccacccaaccccgccagggccagcctccgctccgac ctccccgccttccgtcccagcccctcgagccccaactcccctccccgacgcgaccctcag actcacggcctgaggggctcctccggcaaaacagcggctggctcggaccctccctcagag cacactaaccagcagcacccgggaccgccagctactcgccggatgtacgtcacacccctc cccgggacttccgggcgcgtaatgggccccgccccctcacagcgttacgcctctctgccc cggtgccgtcccaccagcgactcgggccgcggaggggcgggcataaggcgtgacaaagcg cgctcggggtctggccccgcctcggccccgcctgctcccgtctggctagctgaccgcgcg agacgtggcagcgccggaaccctgggtccggggctgggtcctggagtccctgagcggcct ggtacactcgggatcgcggcttcccattccagacgcaccaggaaagtcagaagtgggtac cgacggtga >gi568815588r:70045420_70246586|GENSCAN_predicted_peptide_8|415_aa MGNASARPPRLGGEERLCPAALHLGVLAVRRRGALSLSVGAACGLVALWQRRRQDSGTMS GFSTEERAAPFSLEYRVFLKNEKGQYISPFHDIPIYADKDVFHMVVEVPRWSNAKMEIAT KDPLNPIKQDVKKGKLRYVANLFPYKGYIWNYGAIPQTWEDPGHNDKHTGCCGDNDPIDV CEIGSKVCARGEIIGVKVLGILAMIDEGETDWKVIAINVDDPDAANYNDINDVKRLKPGY LEATVDWFRRYKVPDGKPENEFAFNAEFKDKDFAIDIIKSTHDHWKALVTKKTNGKGISC MNTTLSESPFKCDPDAARAIVDAPLGGRKWWSLALVIRCFCPEVTHITDISLARASRMTM PDLKVAVRIPHLHCIKPKKKAEGDAKEVKAKVKDKPRRRSAKLSLKLLLHTRAKA >gi568815588r:70045420_70246586|GENSCAN_predicted_CDS_8|1248_bp atggggaacgcctctgcccggccgccccgtctgggaggtgaggagcgcctctgcccggcc gcccttcatctgggagtgctcgcagtgcgcaggcgtggggctctctccttgtcagtcggc gccgcgtgcgggctggtggctctgtggcagcggcggcggcaggactccggcactatgagc ggcttcagcaccgaggagcgcgccgcgcccttctccctggagtaccgagtcttcctcaaa aatgagaaaggacaatatatatctccatttcatgatattccaatttatgcagataaggat gtgtttcacatggtagttgaagtaccacgctggtctaatgcaaaaatggagattgctaca aaggaccctttaaaccctattaaacaagatgtgaaaaaaggaaaacttcgctatgttgcg aatttgttcccgtataaaggatatatctggaactatggtgccatccctcagacttgggaa gacccagggcacaatgataaacatactggctgttgtggtgacaatgacccaattgatgtg tgtgaaattggaagcaaggtatgtgcaagaggtgaaataattggcgtgaaagttctaggc atattggctatgattgacgaaggggaaaccgactggaaagtcattgccattaatgtggat gatcctgatgcagccaattataatgatatcaatgatgtcaaacggctgaaacctggctac ttagaagctactgtggactggtttagaaggtataaggttcctgatggaaaaccagaaaat gagtttgcgtttaatgcagaatttaaagataaggactttgccattgatattattaaaagc actcatgaccattggaaagcattagtgactaagaaaacgaatggaaaaggaatcagttgc atgaatacaactttgtctgagagccccttcaagtgtgatcctgatgctgccagagccatt gtggatgctcctctcggaggaagaaagtggtggagccttgcactggtgattaggtgcttc tgcccagaagtgacgcacatcacagacatttcactggctagagcaagccgaatgaccatg cctgacctcaaggtggcagtgcgtattccccacctccactgcatcaagcccaagaaaaag gctgaaggggatgctaaggaagttaaagccaaggtgaaggacaaaccacggagaagatct gcaaagttatcactaaaactgctcctccacaccagagccaaagcctaa