GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:59:34 Sequence gi568815583r:34690415_34894808 : 204394 bp : 40.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3729 3933 205 2 1 96 56 98 0.004 6.46 1.02 Intr + 15786 15925 140 1 2 109 105 -5 0.052 2.56 1.03 Intr + 18504 18555 52 2 1 73 96 37 0.076 0.56 1.04 Intr + 29247 29431 185 1 2 59 70 133 0.231 7.19 1.05 Term + 31631 31780 150 1 0 46 48 89 0.136 -2.37 1.06 PlyA + 32029 32034 6 1.05 2.00 Prom + 34057 34096 40 -5.55 2.01 Sngl + 42207 42497 291 2 0 54 48 264 0.556 14.40 2.02 PlyA + 42736 42741 6 1.05 3.00 Prom + 54286 54325 40 -4.95 3.01 Init + 55477 55621 145 0 1 74 71 82 0.424 5.46 3.02 Term + 59657 59757 101 0 2 47 42 114 0.255 0.01 3.03 PlyA + 59990 59995 6 1.05 4.05 PlyA - 60493 60488 6 1.05 4.04 Term - 62958 62064 895 2 1 124 54 834 0.956 74.48 4.03 Intr - 64236 64004 233 0 2 68 94 57 0.172 -0.05 4.02 Intr - 64531 64322 210 1 0 58 75 119 0.351 5.79 4.01 Init - 73455 73387 69 0 0 50 79 81 0.236 4.50 4.00 Prom - 90835 90796 40 -3.85 5.15 PlyA - 90919 90914 6 1.05 5.14 Term - 100141 99998 144 1 0 77 44 93 0.880 0.73 5.13 Intr - 100881 100700 182 1 2 72 98 235 0.985 21.57 5.12 Intr - 101867 101676 192 0 0 83 97 295 0.997 28.44 5.11 Intr - 102155 101994 162 0 0 74 105 284 0.988 27.73 5.10 Intr - 103155 102831 325 0 1 97 85 367 0.999 31.72 5.09 Intr - 103538 103428 111 2 0 57 100 45 0.824 2.26 5.08 Intr - 104404 104266 139 0 1 5 97 196 0.969 11.75 5.07 Intr - 105666 105092 575 0 2 58 113 219 0.609 12.01 5.06 Intr - 109080 108971 110 1 2 62 82 88 0.688 4.68 5.05 Intr - 110395 110335 61 1 1 113 90 55 0.941 5.59 5.04 Intr - 129409 129149 261 1 0 117 74 110 0.556 9.26 5.03 Intr - 142817 142686 132 2 0 24 31 135 0.152 1.32 5.02 Intr - 148180 147903 278 2 2 -26 19 481 0.090 26.11 5.01 Init - 149396 149291 106 2 1 69 -12 85 0.342 -2.96 5.00 Prom - 154061 154022 40 -5.05 6.00 Prom + 155378 155417 40 -6.95 6.01 Sngl + 159508 159921 414 0 0 86 53 350 0.934 27.34 6.02 PlyA + 160255 160260 6 1.05 7.14 PlyA - 160301 160296 6 1.05 7.13 Term - 166692 166378 315 0 0 63 42 394 0.997 26.26 7.12 Intr - 169741 169628 114 1 0 121 109 -27 0.841 2.42 7.11 Intr - 172627 172453 175 0 1 53 97 113 0.988 7.72 7.10 Intr - 177195 177110 86 0 2 111 94 72 0.945 7.80 7.09 Intr - 180508 180338 171 1 0 64 75 135 0.996 9.02 7.08 Intr - 183585 183414 172 2 1 -4 110 135 0.999 5.52 7.07 Intr - 184450 184263 188 1 2 90 93 209 0.998 19.17 7.06 Intr - 185592 185521 72 0 0 71 110 33 0.848 2.48 7.05 Intr - 192225 192088 138 0 0 115 98 159 0.999 19.34 7.04 Intr - 194320 194111 210 1 0 103 94 116 0.999 11.89 7.03 Intr - 196247 196112 136 1 1 37 110 96 0.911 6.35 7.02 Intr - 199910 199801 110 2 2 87 99 190 0.995 18.16 7.01 Intr - 203359 203249 111 1 0 97 99 78 0.992 9.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 119066 119116 51 1 0 70 116 40 0.812 6.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:34690415_34894808|GENSCAN_predicted_peptide_1|243_aa MDIETGKDFMTKTPKAIATKAKIDMWDLIKLKSFCTAKETINRVNRQPTKWEKRFANYAF DKGLISSICGMFTNRLLLPEELLSFSPLFLTTSTQQQNSKGQYLGTLLVIGEHLLDNAMP MQLTTGSEFCMSGAKVTAAQIRVTFQALTYCSALVPAEAPWGVFKGEPLADKVTCHLLHL GLLSRKCLPKVAQKAARADAGRRRRQRAGLALTQAASTLRSTPPCSLMRREELNPDSRSL LTP >gi568815583r:34690415_34894808|GENSCAN_predicted_CDS_1|732_bp atggacatagaaacgggcaaagatttcatgacaaagacaccaaaagcaattgcaacaaaa gcaaaaattgatatgtgggatctaattaaactaaagagcttctgcacagcaaaagaaact atcaacagagtaaacagacaacctacaaaatgggagaaaagatttgcaaactatgcattt gacaaaggtctaatatccagcatctgtggcatgttcacaaacagactcctcctcccagag gaattgctgtccttttcccctcttttccttaccacctccactcaacagcagaacagtaag ggacagtaccttggcaccctgcttgtgataggtgaacatctcctggataatgcaatgcca atgcagttaacaacaggctctgaattctgtatgtcaggagctaaagtcacagcagctcag ataagggtcacgttccaagctcttacatactgctctgcccttgtccctgcagaggcccca tggggtgtcttcaaaggggagcctttggctgacaaggtgacttgtcacctgctgcatttg ggcctactcagcagaaagtgccttcccaaggtggcacagaaggcggcgcgagcagatgct gggcgccggcggaggcagcgtgcagggctggcgctcacgcaggccgccagcaccctgcgc tccaccccaccctgctccctaatgagaagagaggagctgaacccagacagccgctccctg ttgacgccttag >gi568815583r:34690415_34894808|GENSCAN_predicted_peptide_2|96_aa MSGLVGKSEKPGTSSWQSKAIKEELMERSKGGRAASLPLCGPRPKAVEGLAPLLVSKATG RENDPEAEQKNAKTSWNLPDISDSTHYHNDRTAFYE >gi568815583r:34690415_34894808|GENSCAN_predicted_CDS_2|291_bp atgtcaggacttgtagggaaatctgagaagccaggcacgtctagctggcagagtaaggcc ataaaggaggagctgatggagaggtctaagggaggccgtgctgcctccctgccactatgt gggccacggccaaaagctgtggagggcctggcgccattgttggtcagcaaggccacaggc agagaaaatgacccagaagcagagcagaagaatgcaaagaccagctggaacctgccagac atctctgattctacccattaccataatgaccgcacagccttctatgagtaa >gi568815583r:34690415_34894808|GENSCAN_predicted_peptide_3|81_aa MKWLLLQTLINTQTFHIQKIKTRSQLPGVYPASADLLRGLTGPQHLLPVEMQQSCLKINR PGNPTLLRNVQAKCSLELEGE >gi568815583r:34690415_34894808|GENSCAN_predicted_CDS_3|246_bp atgaagtggctcctgctgcagaccctgatcaacacccagacttttcatattcagaaaatc aaaaccaggtcacagcttccaggagtctatccagcttctgctgatctgctcagaggactg actggcccacaacatctgctcccagttgagatgcaacaatcgtgtttgaaaatcaatcgt ccaggaaaccccacgttgcttcgcaatgttcaagccaagtgctcattggaattagaaggt gagtag >gi568815583r:34690415_34894808|GENSCAN_predicted_peptide_4|468_aa MESTPGKDAVNIVEMTTKNLDYDAAAPERQVPAEVAGTEPVFGPSPGPQPAAESGAETSP GRASAGGPAGCHPAPPLVPASAGSRPEPRTAPEYNKRKGGFEDFLKNCFFNWEAIHSHSG SAARGGARSGNVPGAAQSLSAASGCTAMGEWTILERLLEAAVQQHSTMIGRILLTVVVIF RILIVAIVGETVYDDEQTMFVCNTLQPGCNQACYDRAFPISHIRYWVFQIIMVCTPSLCF ITYSVHQSAKQRERRYSTVFLALDRDPPESIGGPGGTGGGGSGGGKREDKKLQNAIVNGV LQNTENTSKETEPDCLEVKELTPHPSGLRTASKSKLRRQEGISRFYIIQVVFRNALEIGF LVGQYFLYGFSVPGLYECNRYPCIKEVECYVSRPTEKTVFLVFMFAVSGICVVLNLAELN HLGWRKIKLAVRGAQAKRKSIYEIRNKDLPRVSVPNFGRTQSSDSAYV >gi568815583r:34690415_34894808|GENSCAN_predicted_CDS_4|1407_bp atggaatctactcctggtaaagatgctgtgaacattgttgaaatgacaacaaagaattta gactacgacgctgccgcgccggagcgccaggtgcccgctgaagtagcaggaacagagccg gtgttcggaccctcccctggcccccaacctgcggcggagagcggcgctgagacttctcct gggcgggcgagcgctggaggacctgcaggctgccaccccgccccgcccctcgtcccggcg tccgcgggatccagacccgaaccccggacggcgcccgagtacaataaaaggaaaggggga ttcgaggattttttaaaaaattgcttctttaattgggaggcaattcactcccattctgga agtgcggcccggggaggggccaggagcgggaacgtgcccggtgctgcccagtctttgtct gctgcctccggatgcacagcgatgggggaatggaccatcttggagaggctgctagaagcc gcggtgcagcagcactccactatgatcgggaggatcctgttgactgtggtggtgatcttc cggatcctcattgtggccattgtgggggagacggtgtacgatgatgagcagaccatgttt gtgtgcaacaccctgcagcccggctgtaaccaggcctgctatgaccgcgccttccccatc tcccacatacgttactgggtcttccagatcataatggtgtgtacccccagtctttgcttc atcacctactctgtgcaccagtccgccaagcagcgagaacgccgctactctacagtcttc ctagccctggacagagacccccctgagtccataggaggtcctggaggaactgggggtggg ggcagtggtgggggcaaacgagaagataagaagttgcaaaatgctattgtgaatggggtg ctgcagaacacagagaacaccagtaaggagacagagccagattgtttagaggttaaggag ctgactccacacccatcaggtctacgcactgcatcaaaatccaagctcagaaggcaggaa ggcatctcccgcttctacattatccaagtggtgttccgaaatgccctggaaattgggttc ctggttggccaatattttctctatggctttagtgtcccagggttgtatgagtgtaaccgc tacccctgcatcaaggaggtggaatgttatgtgtcccggccaactgagaagactgtcttt ctagtgttcatgtttgctgtaagtggcatctgtgttgtgctcaacctggctgaactcaac cacctgggatggcgcaagatcaagctggctgtgcgaggggctcaggccaagagaaagtca atctatgagattcgtaacaaggacctgccaagggtcagtgttcccaattttggcaggact cagtccagtgactctgcctatgtgtga >gi568815583r:34690415_34894808|GENSCAN_predicted_peptide_5|925_aa MGKGSQGPVPTGQYYMLGVTASGVKENKKRLENVMGNHDGDHDGDDDGEGEGEGGDNDED GGGGEGDCDDDDDDGDGDGNDGDGDDGEGGDDDSDGVGSCDYDGDCGSDDDGGGDNNVMM VMVMVMIMVEESTLAPSAKNAARGTIYESMSESSIDTESAGVFILDFPTSRIVPWQSSNY APSVPVSILLFVVPKLDIVTIAMAPVALLSSWQLAQASNLLWFQSESLPLDNQIAQNLME ITNGNLHTQYNRNRLTLTGVRLRSDIVMVGLSRKDGFAPVCPARLPHGQSTPKRASLALK SPGDPPQLCHPTGIYKPLRCAAAVSFVLVAALEDEKPLLLPCCPGPAVRTPQQRAGRQAF PLHVAYCPQGWQGCGGPNPQTIQGAPTPQKGGGVGWRHLVFPCPLPFSACPSPAPYLAIP LTAPSPSLHGLGAPWLILSPALGSMNGLGSPSGCEGDQIRQGGRPGPPPLPPAAPTDPVH QRSIKRPSWSQPPRARCRRSRADPPRRRCAKMCDDEETTALVCDNGSGLVKAGFAGDDAP RAVFPSIVGRPRHQEKYNVSLIQSFQNAVIYIPVDMGDELNNLMKAYDIFKGVMVGMGQK DSYVGDEAQSKRGILTLKYPIEHGIITNWDDMEKIWHHTFYNELRVAPEEHPTLLTEAPL NPKANREKMTQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYA LPHAIMRLDLAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKLCYVALDFENEMATA ASSSSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAGIHETTYNSIMKCDIDIR KDLYANNVLSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLS TFQQMWISKQEYDEAGPSIVHRKCF >gi568815583r:34690415_34894808|GENSCAN_predicted_CDS_5|2778_bp atggggaaaggaagccagggaccagtcccaacagggcagtattacatgttgggtgtcaca gcatctggagtcaaggagaacaagaagagattggaaaatgtaatgggtaatcatgatggt gatcatgatggtgatgacgatggtgaaggtgagggtgagggtggtgataatgatgaggat ggtggtggcggtgagggtgattgtgatgatgatgatgatgatggtgatggagatggtaat gatggagatggtgatgatggtgagggtggtgatgatgacagtgatggtgttggtagttgt gattatgatggtgattgtggtagtgatgatgatggtggtggtgacaataatgtaatgatg gtgatggtgatggtgatgattatggttgaagagagcaccctagccccttcagccaagaat gcagcaaggggcaccatctatgaatctatgagtgagtcttctatagacactgaatcagct ggtgtcttcatcttggacttcccaacctccagaattgttccctggcagtcttctaactat gccccctctgtaccagttagcattctgctctttgttgtcccaaaactagacattgtgaca attgcaatggcacctgtggccctcctttcatcgtggcagctagctcaagccagcaactta ctttggttccaatcagaatctttgcctttagataatcaaattgcacaaaacctaatggaa ataaccaatggtaatctgcatacacaatacaataggaacaggcttacccttactggggtc aggcttagatcagacattgtcatggttgggctctccaggaaagatggatttgctccagtt tgccctgcacgtctccctcatggccagtccacgccgaaaagggcttcactggccctcaag agccctggggacccgccccagctctgccatcccactgggatctacaagccactcagatgt gctgctgcggtgtcctttgtgctggtggcagccctggaagatgagaagccgctgttgctc ccctgctgccctggcccagctgtcaggacccctcagcagagggcagggcgccaagccttc ccactgcatgtggcttattgtccccaaggctggcagggctgcggaggaccgaatccacag accatccagggagcacccacaccccagaaagggggaggggtgggctggcgtcacttagtc ttcccctgccccctacccttcagcgcctgcccctccccagctccctatttggccatcccc ctgactgccccctccccttccttacatggtctgggggctccctggctgatcctctcccct gcccttggctccatgaatggcctcggcagtcctagcgggtgcgaaggggaccaaataagg caaggtggcagaccgggccccccacccctgcccccggctgctccaactgaccctgtccat cagcgttctataaagcggccctcctggagccagccacccagagcccgctgccgccggagc cgagccgacccgccccgccgacgctgtgccaagatgtgtgacgacgaggagaccaccgcc ctggtgtgcgacaacggctctgggctggtgaaggccggctttgcgggcgatgacgcgccc cgcgctgtcttcccgtccatcgtgggccgcccgcggcaccaggagaaatacaatgtgtca ttaattcagtcattccaaaatgcagtcatctatattccagttgacatgggtgatgagctt aataacttaatgaaggcatatgatatttttaagggagttatggtgggtatgggtcagaag gactcctacgtaggtgatgaagcccagagcaagagaggcatcctgaccctgaagtatccc atcgagcatggtatcatcaccaactgggacgacatggagaagatctggcaccacaccttc tacaatgagctccgtgtggctcccgaggagcaccccaccctgctcacagaggccccgctg aaccccaaggccaaccgggagaagatgactcagatcatgtttgagaccttcaatgtccct gccatgtacgtggccatccaggcagtgctatccctgtatgcttctggccgtaccacaggc attgttctggactctggggatggtgtaactcacaatgtccccatctatgagggctacgct ttgccccatgccatcatgcgtctggatctggctggtcgggacctcactgactacctcatg aagatcctcactgagcgtggctactcctttgtcaccactgctgaacgtgaaattgtccgt gacattaaagagaagctgtgctatgtcgccctggattttgagaatgagatggccacagct gcctcttcctcctccctggagaagagctatgaactgcctgatggccaagtcatcactatt ggcaatgagcgcttccgctgtcctgagacactcttccagccctccttcattggtatggaa tctgctggcatccatgaaacaacttacaatagcatcatgaagtgtgacattgatatccgc aaggacctgtatgccaacaatgtcttatctggaggcaccactatgtaccctggtattgct gatcgtatgcagaaggaaatcactgctctggctcctagcaccatgaagattaagattatt gctccccctgagcgtaaatactctgtctggattgggggctccatcctggcctctctgtcc accttccagcaaatgtggattagcaagcaagagtacgatgaggcaggcccatccattgtc caccgcaaatgcttctaa >gi568815583r:34690415_34894808|GENSCAN_predicted_peptide_6|137_aa MKRQTFYQQGGEPRTSSPTKAMENQEALKTCALVGQLENLVQHEASDLLANSIVPLGIVI GSIFLACDELLRVEELVVDGNVNFVNECGLQVYKHCPGLMLASTCLTEDVKGVISPSGLV TWHLAIGLHAVFQAAEL >gi568815583r:34690415_34894808|GENSCAN_predicted_CDS_6|414_bp atgaagagacagacattctatcagcagggaggtgaacccagaaccagttccccaaccaaa gctatggaaaaccaagaagccctgaagacttgtgcactggttggccagcttgagaatttg gtccaacatgaggccagtgatctcctcgccaatagtatagtgcccttgggcatagttatt ggcagcatcttccttgcctgtgatgagctgctcagggtggaagagctagtggtagatggc aacgtgaattttgtcaatgaatgtgggctccaggtctacaaacactgccctgggcttatg cttgccagcacctgtctcaccgaagatgttaaaggagtcatctccccgagtggtcttgtc acttggcacctggccatcgggctgcatgccgtgttccaggccgcagagctctga >gi568815583r:34690415_34894808|GENSCAN_predicted_peptide_7|665_aa VVGPPGTGKTDVAVQIISNIYHNFPEQRTLIVTHSNQALNQLFEKIMALDIDERHLLRLG HGEEELETEKDFSRYGRVNYVLARRIELLEEVKRLQKSLGVPGDASYTCETAGYFFLYQV MSRWEEYISKVKNKGSTLPDVTEVSTFFPFHEYFANAPQPIFKGRSYEEDMEIAEGCFRH IKKIFTQLEEFRASELLRSGLDRSKYLLVKEAKIIAMTCTHAALKRHDLVKLGFKYDNIL MEEAAQILEIETFIPLLLQNPQDGFSRLKRWIMIGDHHQLPPVIKNMAFQKYSNMEQSLF TRFVRVGVPTVDLDAQGRARASLCNLYNWRYKNLGNLPHVQLLPEFSTANAGLLYDFQLI NVEDFQGVGESEPNPYFYQNLGEAEYVVALFMYMCLLGYPADKISILTTYNGQKHLIRDI INRRCGNNPLIGRPNKVTTVDRFQGQQNDYILLSLVRTRAVGHLRDVRRLVVAMSRARLG LYIFARVSLFQNCFELTPAFSQLTARPLHLHIIPTEPFPTTRKNGERPSHEVQIIKNMPQ MANFVYNMYMHLIQTTHHYHQTLLQLPPAMVEEGEEVQNQETELETEEEAMTVQADIIPS PTDTSCRQETPAFQTDTTPSETGATSTPEAIPALSETTPTVVGAVSAPAEANTPQDATSA PEETK >gi568815583r:34690415_34894808|GENSCAN_predicted_CDS_7|1998_bp gttgtgggcccacctggtacaggcaaaacagatgtggcagttcagatcatatccaacatc taccacaacttcccagaacagaggactctaattgttactcattccaatcaggccctaaac cagttgtttgagaaaatcatggcattagacattgatgagcgccacctactgcgtcttggt catggagaagaagagctggagacagagaaagatttcagcaggtatggaagagttaattat gttctggctcgaagaatagaacttttagaagaagtcaaacgattgcaaaagagtctaggg gttccaggagatgcctcatatacctgtgaaactgcaggctatttcttcttataccaggta atgtctcgctgggaagagtatatcagcaaagtgaaaaataaaggtagtacattgccagat gttacggaagtctccactttcttccctttccatgaatactttgcaaatgctcctcaaccc atttttaaaggaagatcttatgaagaagacatggaaattgctgaaggatgtttcaggcat attaagaaaatctttacgcagcttgaggaattcagagcctctgaattgcttcgaagtgga ctggacagatctaaataccttttagtgaaagaagccaaaattattgctatgacctgtact catgctgccttaaaacgacatgacttggtcaagctaggtttcaagtatgacaacattttg atggaagaggctgctcagattctggagatagaaacttttatccctcttcttctacagaat cctcaggatggatttagccgactaaaacgatggattatgattggcgatcatcaccagtta cctccagttattaagaacatggcctttcaaaagtactcaaacatggagcagtctctcttc actcgctttgttcgcgttggagttccgactgttgaccttgatgctcaagggagagccaga gcaagcttgtgcaacctctacaactggcgatacaagaatctaggaaacttaccccatgtg cagctcttgccagagtttagtacagcaaatgctggcttactgtatgacttccagctcatt aatgttgaagattttcaaggagtgggagaatctgaacctaatccttacttctatcagaat cttggagaggcagaatatgtagtagcactttttatgtacatgtgtttacttggttaccct gctgacaaaatcagtattctaacaacatataatggccaaaagcatcttattcgcgacatc atcaatagacgatgtggaaacaatccattgattggaagaccaaacaaggtgacaactgtt gatagatttcaaggtcaacagaatgactatattcttctttctctggtacgaaccagggca gtgggccatctgagggatgtccgtcgcttggtagtggccatgtctagagccagacttgga ctttatatcttcgccagagtatccctcttccaaaactgttttgaactgactccagctttc agtcagctcacagctcgcccccttcatttgcatataattccaacagaacctttcccaact actagaaagaatggagagagaccatctcatgaagtacaaataataaaaaatatgccccag atggcaaactttgtatacaacatgtacatgcatttgatacagactacacatcattatcat cagactttattacaactaccacctgctatggtagaagagggtgaggaagttcaaaatcaa gaaacagaattggaaacagaagaagaggccatgactgttcaagctgacatcatacccagt ccaacagacaccagctgccgtcaagaaactccagcctttcaaactgacaccacccccagt gagacaggagccacttccactccagaagccatccctgctttatctgagaccacccctact gtggtaggagctgtatctgcaccggcagaagctaacacacctcaggatgccacatctgcc ccggaagagaccaagtag