GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:07:58 Sequence gi568815593r:152292194_152505113 : 212920 bp : 38.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 520 599 80 0 2 74 87 94 0.851 8.48 1.02 Intr + 3613 3689 77 1 2 31 99 55 0.608 -0.86 1.03 Intr + 11729 11930 202 0 1 57 74 123 0.313 5.22 1.04 Intr + 14163 14218 56 0 2 52 72 75 0.294 0.00 1.05 Intr + 17990 18111 122 0 2 83 56 72 0.234 2.79 1.06 Term + 18505 18603 99 0 0 75 44 121 0.304 3.55 1.07 PlyA + 22007 22012 6 1.05 2.04 PlyA - 25526 25521 6 1.05 2.03 Term - 27209 27040 170 0 2 63 49 103 0.192 0.96 2.02 Intr - 46667 46558 110 1 2 93 27 84 0.075 1.81 2.01 Init - 50589 48488 2102 0 2 70 53 729 0.160 55.14 2.00 Prom - 50663 50624 40 -15.32 3.02 PlyA - 50698 50693 6 1.05 3.01 Sngl - 51579 50854 726 0 0 44 36 294 0.751 15.77 3.00 Prom - 51837 51798 40 -5.25 4.02 PlyA - 51919 51914 6 -4.04 4.01 Sngl - 52953 51937 1017 0 0 88 43 704 0.973 62.67 4.00 Prom - 56529 56490 40 -7.35 5.07 PlyA - 58082 58077 6 1.05 5.06 Term - 66466 66312 155 2 2 -15 55 161 0.021 -0.40 5.05 Intr - 71667 71404 264 1 0 47 80 158 0.020 7.46 5.04 Intr - 100308 100096 213 1 0 82 29 111 0.512 2.36 5.03 Intr - 103391 103266 126 0 0 67 98 91 0.805 7.73 5.02 Intr - 105951 105867 85 0 1 101 75 35 0.511 1.97 5.01 Init - 112911 112195 717 0 0 57 70 781 0.664 68.10 5.00 Prom - 117938 117899 40 -5.05 6.07 PlyA - 118723 118718 6 1.05 6.06 Term - 131649 131385 265 2 1 41 41 118 0.162 -3.60 6.05 Intr - 136244 136164 81 1 0 74 75 131 0.617 8.23 6.04 Intr - 142108 141976 133 2 1 30 81 97 0.419 1.98 6.03 Intr - 144372 144267 106 0 1 26 46 93 0.122 -2.23 6.02 Intr - 157284 157130 155 1 2 81 95 70 0.380 5.87 6.01 Init - 159363 159294 70 0 1 63 101 29 0.526 2.96 6.00 Prom - 163531 163492 40 -7.65 7.03 PlyA - 164816 164811 6 1.05 7.02 Term - 170146 169945 202 0 1 91 42 139 0.530 5.48 7.01 Init - 192603 192536 68 0 2 62 75 70 0.165 3.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 65164 65372 209 0 2 76 74 148 0.863 9.15 S.002 Term + 71987 72155 169 2 1 21 46 186 0.863 4.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:152292194_152505113|GENSCAN_predicted_peptide_1|211_aa MDDLEGFKASVEEVTEDEVEIARELKFYSRGTTAESSSSASLASLSSFIIVIGWQLLHWN CTDDDLFVVQILIQTPAQSGTFAMSIAGPPFFFPASAALRDFVPDPSAIDKSQKKRMEKN WLYLIVSLFNVKNKCEATTAGCVTGCVAQAADSVGEISSQAYLGMKALRIHIWERKETKA MDWYWSVAWGLGTPALNERIVEDVIPDGESE >gi568815593r:152292194_152505113|GENSCAN_predicted_CDS_1|636_bp atggatgaccttgaggggttcaaggcttctgtggaggaagtcactgaagatgaggtggaa atagcaagagaactaaaattctacagcagaggaacaactgcagaatcttcgtcatcagca tcattagcatctttatcatcatttattattgttattggctggcaactcctccactggaac tgtactgatgatgatctctttgtagtccagatactcatccaaactccagcccaatctgga acatttgccatgagtattgctggccccccattcttcttccctgcctcagctgctctgagg gattttgtgcctgatccatcagcaattgacaagagccagaagaaaaggatggaaaagaat tggctttatctcatcgttagccttttcaacgtcaaaaacaaatgtgaagccaccactgct ggctgtgtcacaggttgtgttgctcaggcagcagactctgtgggggagattagcagtcag gcttatttaggcatgaaagctctcaggatccacatctgggaaaggaaagaaacaaaagcc atggactggtactggtctgtggcctgggggttggggacccctgctctaaatgaacgaata gtggaggacgtcattcctgatggggaatctgaatga >gi568815593r:152292194_152505113|GENSCAN_predicted_peptide_2|793_aa MDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NQIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKI QQRFMLKTLIKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADNMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFI WNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIM PHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLN VRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIR VNRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYAA KKHMKKCSSSLAIREMQIKTTIRYHLTPVRMAIIKKSGNNRKADLGNCMLEEYTGCSRNP EKEHSEKYLAHNKHNSKSPIPSSDQQPETDQKPGLGHRCGLYAGERNLVSFQEDTSYYCG LVGAELKALSPQT >gi568815593r:152292194_152505113|GENSCAN_predicted_CDS_2|2382_bp atggatacattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaatagtttaccaaccaaa aagagtccaggaccagatgggttcacagctgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgacgcaaaaatcctcaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaatt caacaacgcttcatgctaaaaactctcattaaattaggtattgatgggacgtatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccgctcctattc aacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacaacatgattgtttatcta gaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaagaggacacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgg tactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataatg ccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaa ctggatcccttccttacaccttatacaaaaatcaattcaagatggattaaagatttaaac gttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacata ggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaagccaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaga gtgaacaggcaacctacaacatgggagaaaattttcgcaacctactcatctgacaaaggg ctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaacccc atcaaaaagtgggtgaaggacatgaacagacacttctcaaaagaagacatttatgcagcc aaaaaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaacc actattagatatcatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaac aggaaagcagatcttggcaactgtatgttggaagaatacacaggatgttctaggaaccca gagaaggaacactcagaaaaatacttagcacacaataagcacaatagcaaatcccccatc ccctcttccgaccagcagcctgaaacagaccagaaacctggtctagggcacaggtgtgga ttatatgcaggagaaagaaatcttgttagctttcaggaggatacaagctattactgtggt cttgtgggtgctgagctcaaagcattaagtccgcaaacataa >gi568815593r:152292194_152505113|GENSCAN_predicted_peptide_3|241_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCK RTQIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLCDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTYSKASKGK K >gi568815593r:152292194_152505113|GENSCAN_predicted_CDS_3|726_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatttatgcacccaat acaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcggacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacacaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgtgacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacatattcaaaagctagcaaaggcaag aaataa >gi568815593r:152292194_152505113|GENSCAN_predicted_peptide_4|338_aa MGKKQNRKTGNSKTQSTSPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLHLIGVPESDVENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQSFSSRRATPRHIIVRFTKVEMKEKILRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEVLNMERKNRYQPLQNHAKM >gi568815593r:152292194_152505113|GENSCAN_predicted_CDS_4|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcacctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacatctg attggtgtacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagtttctcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatattaagggcagccagagagaaaggtcgggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagtg ctaaacatggaaaggaaaaaccggtaccagccactgcaaaatcatgccaaaatgtaa >gi568815593r:152292194_152505113|GENSCAN_predicted_peptide_5|519_aa MEKLQNASWIYQQKLEDPFQKHLNSTEEYLAFLCGPRRSHFFLPVSVVYVPIFVVGVIGN VLVCLVILQHQAMKTPTNYYLFSLAVSDLLVLLLGMPLEVYEMWRNYPFLFGPVGCYFKT ALFETVCFASILSITTVSVERYVAILHPFRAKLQSTRRRALRILGIVWGFSVLFSLPNTS IHGIKFHYFPNGSLVPGSATCTVIKPMWIYNFIIQVTSFLFYLLPMTVISVLYYLMALRL KKDKSLEADEGNANIQRPCRKSVNKMLFVLVLVFAICWAPFHIDRLFFSFVEEWSESLAA VFNLVHVVSGVFFYLSSAVNPIIYNLLSRRFQAAFQNVISSFHKQWHSQHDPQLPPAQRN IFLTECHFVELTEDIGPQFPFYFMDASDVGSSKGDGERMMKYQGGQHELLTSLSSFAANI PATVNLVHLTSLVLHSPLPITLTTAYNIVIYHLDNCNNNLNRTPKPSLTRYLSAVTVAAK AGYKQSIETGREARQLVRPEILRREYALTLKRPRRAMAR >gi568815593r:152292194_152505113|GENSCAN_predicted_CDS_5|1560_bp atggaaaaacttcagaatgcttcctggatctaccagcagaaactagaagatccattccag aaacacctgaacagcaccgaggagtatctggccttcctctgcggacctcggcgcagccac ttcttcctccccgtgtctgtggtgtatgtgccaatttttgtggtgggggtcattggcaat gtcctggtgtgcctggtgattctgcagcaccaggctatgaagacgcccaccaactactac ctcttcagcctggcggtctctgacctcctggtcctgctccttggaatgcccctggaggtc tatgagatgtggcgcaactaccctttcttgttcgggcccgtgggctgctacttcaagacg gccctctttgagaccgtgtgcttcgcctccatcctcagcatcaccaccgtcagcgtggag cgctacgtggccatcctacacccgttccgcgccaaactgcagagcacccggcgccgggcc ctcaggatcctcggcatcgtctggggcttctccgtgctcttctccctgcccaacaccagc atccatggcatcaagttccactacttccccaatgggtccctggtcccaggttcggccacc tgtacggtcatcaagcccatgtggatctacaatttcatcatccaggtcacctccttccta ttctacctcctccccatgactgtcatcagtgtcctctactacctcatggcactcagacta aagaaagacaaatctcttgaggcagatgaagggaatgcaaatattcaaagaccctgcaga aaatcagtcaacaagatgctgtttgtcttggtcttagtgtttgctatctgttgggccccg ttccacattgaccgactcttcttcagctttgtggaggagtggagtgaatccctggctgct gtgttcaacctcgtccatgtggtgtcaggtgtcttcttctacctgagctcagctgtcaac cccattatctataacctactgtctcgccgcttccaggcagcattccagaatgtgatctct tctttccacaaacagtggcactcccagcatgacccacagttgccacctgcccagcggaac atcttcctgacagaatgccactttgtggagctgaccgaagatataggtccccaattccca ttctattttatggatgccagtgatgtaggcagtagtaagggtgatggggaaaggatgatg aagtaccaaggaggtcaacatgaactgttgacttctctttcctcatttgctgcaaatatc ccagcaacagttaatcttgttcatttaacctccttggtacttcatagtcctctccccatc acccttactactgcctacaacattgtcatctatcacttagataactgcaacaacaatctt aacagaacccccaagcccagtctgacaaggtatttatcagcagtaacagttgcagcaaaa gctggttacaaacaatccatagaaacaggacgtgaagctagacaactggttagaccagaa attctcagaagggagtatgccttaaccctaaagaggcccagaagagccatggcaagatga >gi568815593r:152292194_152505113|GENSCAN_predicted_peptide_6|269_aa MQEYMGTLYYLLNFPVNQKLPLKYLRSIISSSKKHCLTQIEMTNSELVKVLKKQMPRDLL EEMPVSVLEKISHCDVFVEIKLGAQEVKSLAKSSRSLEAAVLTPCPVLLSELEAIILSEI IHKQKVKYVLTYKWELNNEYMEVQSGIMVIGDLKRAERDLKDDRPNINFTDDDTEGESTE KSSLLMDFLIAAFLPEQHKSLLQDFYNSIQLTTLSLNPNSKYLMALQLAAIIFPQYILPA GLEFFKARSPFLFLSSSHNPIQSLQKLHI >gi568815593r:152292194_152505113|GENSCAN_predicted_CDS_6|810_bp atgcaggagtatatgggaactctgtactaccttctcaatttccctgtcaatcaaaaactg cccttaaaatatctcagatcaattatctcctcctccaagaaacattgtctgacacaaatt gaaatgaccaattctgagttagttaaagtcctcaagaagcagatgccaagagatttattg gaagaaatgcctgtgagtgtgctggagaaaatatcacactgtgatgtttttgttgagata aagctaggggctcaagaagtaaagagccttgctaagtcatctagatccttagaagctgct gttctaactccctgtccagtgcttctcagtgaactggaggccattatcctcagtgaaata attcacaaacagaaagtcaaatatgttctcacttataagtgggagctaaataatgagtac atggaagtacaaagtggaataatggtcattggagacctcaaaagagcagaaagggacctt aaagatgacagaccaaacatcaacttcacagatgatgacacagaaggtgaatctactgaa aagagctctctgctgatggatttccttattgctgccttcctgcctgaacaacacaagtca ctgctccaggatttctataactctatacaacttactactctttccctgaaccctaacagc aaatacttgatggcacttcagcttgctgctatcatcttcccacagtatattctacctgct ggactagaatttttcaaagcacgttcaccatttctgtttctatccagtagtcataatccc atccagtcacttcagaagctgcacatctaa >gi568815593r:152292194_152505113|GENSCAN_predicted_peptide_7|89_aa MMTQCDQCYHKGTNFKVVVVEERERCDFFGEACGASALCSHSTLYFPIMTLNAFCYGGQE NSPLRSLKEGAQFNGLSTEAISSPAAPSL >gi568815593r:152292194_152505113|GENSCAN_predicted_CDS_7|270_bp atgatgacacagtgtgatcaatgctatcacaagggaactaacttcaaggtggtagtggtg gaagagagagagagatgcgacttctttggggaagcctgtggtgcttctgctttgtgttct catagcacgttgtacttccctatcatgacactcaatgcattttgttatggtggccaggaa aattcacctctccggtctctgaaggagggggcacaattcaatggcctgagcactgaagcc atatcttcaccagctgctcccagcctatga