GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:59:09 Sequence gi568815589r:19276296_19479620 : 203325 bp : 40.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10474 10726 253 1 1 77 95 252 0.998 21.31 1.02 Intr + 12301 12370 70 0 1 82 119 42 0.867 4.54 1.03 Intr + 14565 14581 17 1 2 89 95 24 0.465 -2.46 1.04 Intr + 19713 19951 239 2 2 80 53 138 0.852 4.99 1.05 Intr + 23892 24036 145 0 1 93 87 34 0.779 3.16 1.06 Intr + 29057 29232 176 1 2 78 75 25 0.069 -1.98 1.07 Intr + 40122 40222 101 0 2 59 95 40 0.215 0.63 1.08 Intr + 40326 40544 219 1 0 70 96 238 0.908 20.15 1.09 Intr + 48067 48212 146 2 2 107 93 99 0.984 11.38 1.10 Intr + 49644 49679 36 2 0 91 95 48 0.909 3.34 1.11 Intr + 58682 58810 129 1 0 113 115 27 0.986 7.87 1.12 Intr + 59975 60119 145 1 1 92 90 111 0.999 10.63 1.13 Intr + 60391 60537 147 2 0 68 79 140 0.930 10.39 1.14 Intr + 64697 64819 123 0 0 84 37 148 0.997 8.94 1.15 Intr + 66338 66484 147 0 0 53 105 99 0.989 7.39 1.16 Intr + 69626 70791 1166 0 2 107 94 624 0.977 52.43 1.17 Intr + 74407 74584 178 0 1 27 80 131 0.975 4.77 1.18 Intr + 75778 75887 110 2 2 87 83 20 0.732 0.48 1.19 Intr + 76195 76370 176 0 2 109 96 60 0.995 6.72 1.20 Intr + 80677 80859 183 1 0 86 64 181 0.992 13.48 1.21 Intr + 81733 81865 133 1 1 43 68 126 0.708 5.83 1.22 Intr + 83949 84194 246 2 0 99 91 128 0.985 10.93 1.23 Intr + 85575 85668 94 2 1 93 83 24 0.740 1.02 1.24 Term + 92934 93061 128 1 2 55 49 91 0.662 -0.64 1.25 PlyA + 93865 93870 6 1.05 2.07 PlyA - 94277 94272 6 1.05 2.06 Term - 100093 99998 96 1 0 84 44 145 0.997 6.69 2.05 Intr - 100356 100199 158 1 2 69 96 214 0.983 19.11 2.04 Intr - 102219 102073 147 1 0 47 97 176 0.995 13.69 2.03 Intr - 102623 102413 211 2 1 99 100 188 0.997 18.66 2.02 Intr - 103323 103192 132 0 0 87 93 255 0.723 25.82 2.01 Init - 103900 103895 6 1 0 74 100 10 0.565 1.16 2.00 Prom - 107921 107882 40 -5.75 3.00 Prom + 112652 112691 40 -2.35 3.01 Init + 118351 118563 213 0 0 65 48 114 0.147 3.89 3.02 Intr + 124102 124171 70 0 1 118 101 37 0.308 5.84 3.03 Term + 126425 126558 134 0 2 92 43 95 0.297 2.67 3.04 PlyA + 127459 127464 6 1.05 4.02 PlyA - 127572 127567 6 1.05 4.01 Sngl - 133089 132685 405 0 0 63 48 331 0.919 22.63 4.00 Prom - 147023 146984 40 -6.35 5.00 Prom + 148602 148641 40 -4.45 5.01 Init + 169996 170123 128 0 2 84 91 224 0.999 19.98 5.02 Term + 174155 174341 187 2 1 131 45 149 0.812 10.88 5.03 PlyA + 177279 177284 6 1.05 6.07 PlyA - 177604 177599 6 1.05 6.06 Term - 182359 182251 109 0 1 125 38 99 0.607 5.60 6.05 Intr - 185062 184656 407 1 2 74 24 393 0.469 23.72 6.04 Intr - 186538 186332 207 1 0 47 44 138 0.862 3.65 6.03 Intr - 188198 188113 86 0 2 45 69 99 0.189 2.32 6.02 Intr - 189515 189347 169 2 1 58 58 76 0.040 0.20 6.01 Intr - 195774 195549 226 2 1 83 89 136 0.392 10.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 147567 147681 115 2 1 117 106 15 0.877 5.30 S.002 Term - 166167 166042 126 0 0 109 48 82 0.953 3.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:19276296_19479620|GENSCAN_predicted_peptide_1|1502_aa XVLYEGKERLIPGCEVILATPYGRCANVNNSSTTSQRIFITYRRAPPVRPQNSLAVTDIC VIVTSKGETPPHTFCKVDKNLNCGMWGSSVFLCYKKSVPASNAIAYKAGSSAKKVYGAAI QFYEPYSRELLSEKQLMHLGLLTPVERKMVSKSINTNKCICLLSHWPFFEAFRKFLMFIY KLSVSGPHPLPIENGANFSTLLMNLGPENCATLLLFVLLESKILLHSLRPAVLTGVAEAV VAMIFPFQWQCPYIPLCPLSLAAVLSAPLPFIVGVDSRYFDLHDPPQDVVCIDLDTNMLY VSDEKKNMNWKQLPKKPCKNLLSTLKKLYPQLSSVHQKTQEGSAIDMTPIEADFSWQKKM TQLEMEIQEAFLRFMASILKGYRTYLRPITEAPSNKATAADSLFDRQGFLKSRDRAYAKF YTLLSKTQIFIRFIEECSFVSDKDTGLAFFDDCIEKLFPDKGTEKTDKVCYRVVMQLCGL WGHPVLAVRVLFEMKTARIKPNAITYGYYNKVVLESPWPSSTRSGIFLWTKVRNVVRGLA QFRQPLKKTVQRSQVSSISALQNVTGGSDGDTVSHGSVDSSNDANNGEHTVFVRDLIRLE SIDNHSSTGGQSDQGYGSKDELIKDDAEIHVPEEQAARELITKTKMQTEEVCDASAIVAK HSQPSPEPHSPTEPPAWGSSIVKVPSGIFDVNSRKSSTGSISNVLFSTQDPVEDAVFGEA TNLKKNGDRGEKRQKHFPERSCSFSSESRAGMLLKKSSLDSNSSEMAIMMGADAKILTAA LTCPKTSLLHIARTHSFENVSCHLPDSRTCMSESTWNPEHRSSPVPEMLEESQELLEPVV DDVPKTTATVDTYESLLSDSNSNQSRDLKTVSKDLRNKRSSLYGIAKVVQREDVETGLDP LSLLATECTGGKTPDSEDKLFSPVIARNLADEIESYMNLKSPLGSKSSSMELHREENRES GMTTAFIHALERRSSLPLDHGSPAQENPESEKSSPAVSRSKTFTGRFKQQTPSRTHKERS TSLSALVRSSPHGSLGSVVNSLSGLKLDNILSGPKIDVLKSGMKQAATVASKMWVAVASA YSYSDDEEETNRDYSFPAGLEDHILGENISPNTSISGLVPSELTQSNTSLGSSSSSGDVG KLHYPTGEVPFPRGMKGQDFEKSDHGSSQNTSMSSIYQNCAMEVLMSSCSQCRACGALVY DEEIMAGWTADDSNLNTACPFCKSNFLPLLNIEFKDLRGSASFFLKPSTSGDSLQSGSIP LANESLEHKPVSSLAEPDLINFMDFPKHNQIITEETGSAVEPSLSKRNVSLTRSHSVGGP LQNIDFTQRPFHGISTVSLPNSLQEVVDPLGKRPNPPPVSVPYLSPLVLRKELESLLENE GDQVIHTSSFINQHPIIFWNLVWYFRRLDLPSNLPGLILTSEHCNEGVQDSKLVYIQLLW DNINLHQEPREPLYVSWRNFRIQVPSVVPAFMAAGVQAITFQPQAVSHHILEEKGRVTGL PV >gi568815589r:19276296_19479620|GENSCAN_predicted_CDS_1|4509_bp nnagttctatatgaagggaaagaacggcttattccaggatgtgaagtgatcctagccaca ccctatggtcgctgtgccaatgtcaacaatagttcaactacttcacaaagaatctttatc acttatcgaagggctcctccagttcgaccccagaattccttggctgtaactgatatctgt gttattgtaaccagtaaaggagaaactcctcctcataccttctgcaaagttgacaaaaac ttaaattgtggaatgtggggttccagcgtgtttctgtgttataagaagtctgtacctgct tcaaatgcaatagcatataaggctggttcttcagccaaaaaggtatatggagctgccatt cagttttatgaaccttactctcgggaacttctatcagagaaacagcttatgcacctgggc ttgttgacgcctgtggagagaaaaatggtctccaaatccatcaatacaaacaaatgcatt tgtttactctcacactggcctttttttgaagcttttaggaaatttcttatgtttatctac aaactttctgtgtctggaccacatcctcttcccattgaaaatggagccaactttagcacc ttgctaatgaatctgggtcctgagaattgtgcaacactgctgctctttgttttacttgag agtaaaattctgctgcattctcttaggccagctgtcttgactggggtagctgaagctgtt gtagctatgatctttccatttcagtggcaatgcccatatattcccctttgtcctctttca ctggctgcagtgcttagtgcacctttaccatttatagttggagttgactcaaggtatttt gatcttcatgacccaccacaagatgttgtttgcattgacttggatacgaacatgttatat gtatcagatgaaaagaagaacatgaactggaagcaacttcccaaaaagccgtgcaaaaat ctacttagcaccttaaagaaattgtatccccagctgtcttcagttcaccaaaaaactcaa gaaggctcagcgattgacatgactccaattgaagcagatttctcctggcaaaagaagatg acacagcttgagatggaaattcaagaggcatttttgcgctttatggcgtctattttaaaa ggatatagaacatatctcagaccaatcacagaggctccttcaaataaagccacagctgct gattcattgtttgaccgacagggatttttaaaaagtcgagatcgtgcctatgcaaaattc tatacccttttatccaaaacacagatttttattcgtttcattgaagaatgcagttttgta agtgataaagatactggattagcattttttgatgactgcatagaaaagttgtttcctgat aaaggcacagagaaaacagataaggtgtgctatcgagtagtgatgcagctttgtggactt tggggtcatcctgttttagcagtgagagtcttatttgaaatgaaaactgctaggataaag cctaatgctattacttatggttattataataaggtagtcttggagagcccgtggcctagc agtacccgcagtggtattttcttatggacgaaggtacggaatgtggtacgtggcttggca cagtttaggcagccgcttaaaaagactgtgcaaaggtcacaggtctcctcaatatcagct cttcaaaatgtcacaggtggaagtgatggggacacggtgagccacggtagtgtggatagt tctaatgatgctaacaatggggagcacacagtcttcgtcagagatttaatcaggcttgag tccattgataatcactctagcacaggtggtcagtctgaccaaggatacgggtctaaggat gaacttataaaggatgatgcagaaattcatgtgcctgaagaacaggcagcaagagaattg ataactaaaacaaaaatgcaaacagaagaggtgtgtgatgcctctgctattgtggcaaaa cattcacaacctagtccagagcctcacagtcctactgaacctcctgcatggggcagcagt attgtgaaagttccgtctggtatatttgatgtcaacagcaggaaaagtagcactggtagt atatcaaatgtgctgttttctactcaagatccagttgaagatgcagtctttggcgaagct actaatctcaagaagaatggtgatagaggagaaaaaagacaaaagcattttcctgagagg agttgtagttttagttctgaaagtcgagcaggaatgttgcttaagaagagtagtttggat tcgaattcaagtgaaatggctatcatgatgggagcagatgccaagattctcacagcagca ttgacatgtcctaagacttctctacttcatattgcaagaacccatagctttgagaatgtt agctgtcacctacctgatagtaggacttgtatgtctgaaagcacttggaatcctgagcac agatcatctccggtgccagagatgcttgaggaaagccaagaactccttgagcctgtggtt gatgacgtacctaaaactactgcaacagtagatacatatgagagtctactaagtgatagt aacagtaatcagtccagagacttgaaaacagtatccaaagatctgaggaataagagaagt agtttatatggtattgctaaggtggttcagagggaagatgttgaaactggactagatcct ttgtctcttttagccactgaatgtacaggaggaaaaactcctgattctgaagataagttg ttttctccagttattgcacgtaatctggctgatgaaatagaaagctatatgaacctaaaa agtcccctaggtagtaaatcttctagtatggaattacacagagaggaaaacagagagtct ggcatgactactgcatttattcatgctctagagaggagatcaagcctacctttagatcat ggttcaccagcacaggaaaatcctgaaagtgaaaagagctcacctgcagtgtccaggtct aaaacttttactgggcgtttcaagcagcaaaccccctctcgaactcataaagaacgttca acttctttgtcagcactggtgcgttcttcgccacatggctcgttgggttctgtagtaaat tctttgtcagggctaaagctggataatatactctcagggcccaagatagatgtcctgaaa tctggtatgaaacaagcagcgacagtagccagtaagatgtgggtagctgttgcgtctgcc tacagctactcagatgatgaggaagaaactaatagagactacagcttcccagctggccta gaagaccatattttgggggagaatatatcgcctaacacaagtatctcagggttggtcccc agtgaacttacccagagcaacacaagtcttggcagtagcagcagtagtggagatgtagga aaactgcattatccaacaggtgaagttccatttccaagaggcatgaaagggcaagacttt gaaaaatcagatcatggttcttctcaaaataccagcatgtctagcatctatcagaattgt gcaatggaggttttgatgtccagttgttcacagtgtagagcttgtggagctttagtttat gatgaagaaattatggctggatggacagcagatgactcaaatttgaatacagcttgtcca ttctgtaaaagcaacttcttgcctcttctcaatatagaattcaaagatttgagaggttct gcaagctttttcctgaaaccaagtacctctggtgacagtttacaaagtggaagcattcca ttggcaaatgaatccttggagcacaaacctgtatccagtttagcagaacctgacttgatc aactttatggacttcccaaaacataaccagatcataactgaagaaacaggctctgcagtt gaaccaagtttatcaaagcgaaatgtgtctttgactcgaagtcacagtgttggaggccca ttgcagaatattgactttacccagcgaccgtttcatggcatctcaacagttagtcttcca aatagtctgcaggaagttgtggatcctttaggaaaaagacccaatcctccccctgtttct gtgccctacttgagtcctctagtactccgtaaagaacttgaatctttgctagaaaatgaa ggtgatcaggtgattcatacatcttctttcatcaatcaacatccaatcattttctggaac ctcgtttggtatttcagacgtttggaccttcctagtaacttgccaggacttatcctcaca tctgaacattgtaatgaaggtgtacaggatagcaaacttgtgtatattcagctgttatgg gataatatcaaccttcatcaggaaccaagagaacctctgtatgtctcatggaggaatttt aggatccaggttccttctgttgtacctgcttttatggctgctggagtacaagctatcaca ttccagcctcaggcagtaagccaccatatattagaagagaaaggcagagtgacaggtctg cctgtttga >gi568815589r:19276296_19479620|GENSCAN_predicted_peptide_2|249_aa MKLNISFPATGCQKLIEVDDERKLRTFYEKRMATEVAADALGEEWKGYVVRISGGNDKQG FPMKQGVLTHGRVRLLLSKGHSCYRPRRTGERKRKSVRGCIVDANLSVLNLVIVKKGEKD IPGLTDTTVPRRLGPKRASRIRKLFNLSKEDDVRQYVVRKPLNKEGKKPRTKAPKIQRLV TPRVLQHKRRRIALKKQRTKKNKEEAAEYAKLLAKRMKEAKEKRQEQIAKRRRLSSLRAS TSKSESSQK >gi568815589r:19276296_19479620|GENSCAN_predicted_CDS_2|750_bp atgaagctgaacatctccttcccagccactggctgccagaaactcattgaagtggacgat gaacgcaaacttcgtactttctatgagaagcgtatggccacagaagttgctgctgacgct ctgggtgaagaatggaagggttatgtggtccgaatcagtggtgggaacgacaaacaaggt ttccccatgaagcagggtgtcttgacccatggccgtgtccgcctgctactgagtaagggg cattcctgttacagaccaaggagaactggagaaagaaagagaaaatcagttcgtggttgc attgtggatgcaaatctgagcgttctcaacttggttattgtaaaaaaaggagagaaggat attcctggactgactgatactacagtgcctcgccgcctgggccccaaaagagctagcaga atccgcaaacttttcaatctctctaaagaagatgatgtccgccagtatgttgtaagaaag cccttaaataaagaaggtaagaaacctaggaccaaagcacccaagattcagcgtcttgtt actccacgtgtcctgcagcacaaacggcggcgtattgctctgaagaagcagcgtaccaag aaaaataaagaagaggctgcagaatatgctaaacttttggccaagagaatgaaggaggct aaggagaagcgccaggaacaaattgcgaagagacgcagactttcctctctgcgagcttct acttctaagtctgaatccagtcagaaataa >gi568815589r:19276296_19479620|GENSCAN_predicted_peptide_3|138_aa MSKRYAENFTEGVMQKATKHVQRYSTSLPIGETYIKTTMRYYYTTIRMAEIKNSDNTKCG DVDKLGNHTLLVSLTAPAPCQPGPVSPHNSGFCKLLCLVKSCSSFNFSSEATSSAKRPMT LSADLGPCPSASTYAPPF >gi568815589r:19276296_19479620|GENSCAN_predicted_CDS_3|417_bp atgagcaaaagatatgcagagaatttcactgaaggagttatgcagaaggctactaagcac gtgcaaagatattcaacatcattacctattggggaaacgtatattaaaaccacaatgaga tattattacacaactatcagaatggctgagataaaaaacagtgacaacaccaaatgtggg gatgtggataaactaggaaatcatacgttgctggtctctctgactgctcctgctccttgt cagccaggacctgtgtctcctcataactcaggcttctgtaaattgctctgcctggtaaaa tcctgttcatcttttaacttcagttccgaggctacctcctctgcaaaacgtcccatgact ctcagtgcagacttgggcccctgcccttctgcttccacatatgcacctccattttag >gi568815589r:19276296_19479620|GENSCAN_predicted_peptide_4|134_aa MGTGEEDRWRTERSAPRRGCGLTQPLERMPPDARRDPRPASRRCGAGTAPSRWPACLPRS RAPHRVVELGDSRHDGVVVLAPVHLRATSLQLVPPVRGAHGHSGALEQLLAQLRRAQSCC GGKTRRQPAAKAAL >gi568815589r:19276296_19479620|GENSCAN_predicted_CDS_4|405_bp atggggacaggagaggaggacagatggagaactgaaaggagcgcgccgaggagggggtgt ggactgactcagcccctggagagaatgcccccagatgcgcgcagagacccacgtccagct tccagacgctgcggcgcgggaacagccccctcccgctggcccgcctgccttccccgctcc cgcgccccgcaccgtgttgtagaactcggcgatagcaggcacgatggtgtagttgtcctc gcaccagtccacctccgagctaccagcctgcagctggtcccaccagtgcggggcgcccat ggccactccggggcattggagcagctgctcgcgcagctgagaagagcccagagctgctgc ggaggcaaaacgcggcgacagccagcggcgaaagcggctttatga >gi568815589r:19276296_19479620|GENSCAN_predicted_peptide_5|104_aa MRVFKLGLFSGLWWTLALFCWISDRAFCELLSSFNFPYLHCMWHILICLAAYLGCVCFAY FDAASEIPEQGPVIKFWPNEKWAFIGVPYVSLLCANKKSSVKIT >gi568815589r:19276296_19479620|GENSCAN_predicted_CDS_5|315_bp atgcgtgtgtttaagctgggcctcttctcgggcctctggtggaccctggccctgttctgc tggatcagtgaccgagctttctgcgagctgctgtcatccttcaacttcccctacctgcac tgcatgtggcacatcctcatctgccttgctgcctacctgggctgtgtatgctttgcctac tttgatgctgcctcagagattcctgagcaaggccctgtcatcaagttctggcccaatgag aaatgggccttcattggtgtcccctatgtgtccctcctgtgtgccaacaagaaatcatca gtcaagatcacgtga >gi568815589r:19276296_19479620|GENSCAN_predicted_peptide_6|401_aa XLQVCPQTCQLVWPPSFWWAAGPLVTPSDQGGSSSREREFLIFSPGDESNDTARPRTKEV PREGVCEIFPGGRSPSLAGCSRGHRRKGGPWPFPRLNPGSSPPESACDPFNLVGRNLREE AVQVTIQPLWKRRAVDRRAIICPAVWQPGDLSPGLVLAARQPWRQAQRPCERPGRTHKAL RDSSEGDKGEMLKTRKTGKEGRDQIVEETSDHTQGLQSGLFCSKSNGEELEPRGPSRRRR RRLDPRTRPAEKTFKQRRTFQQRVDVRLIQEQHPAKSPVITERYQGEKQLPVLDKTKFLV PDVRDVNMSELIKIIRRRLQLNANQAFLLVVNGHSMVSVSTPVSEVYESEKDEDGFLYMV YASQETTVDSSPLRGFGRVSGLLPSPDLSCSIRPHNPLQAA >gi568815589r:19276296_19479620|GENSCAN_predicted_CDS_6|1206_bp nncctccaggtttgtccccagacctgccaactggtgtggccgccttccttttggtgggct gcaggaccactggtgacacctagtgaccaaggcggcagtagcagccgagaacgagaattc ctgattttctctcctggtgatgagagtaatgacactgcccgccccaggaccaaggaagtc cccagggagggagtgtgtgagatttttccaggagggagatccccctcgctcgcaggctgc agccgaggccacaggaggaaaggaggcccctggccctttcctcggttgaacccaggctct tcacccccggagtcggcctgtgaccctttcaatttggtgggtagaaatctccgggaagag gctgtacaagtcacaatacaacccctctggaaaagacgtgccgtggacagaagagccatc atctgcccagcggtctggcagcctggagacctttcccctggcctagtgctcgctgcccgt cagccatggagacaggcccaaaggccatgtgaaagaccaggaagaacacacaaagctctg agagactcgagtgaaggggacaagggggagatgctcaaaaccagaaagactggaaaggaa ggcagggaccagatcgtggaggagacttcagatcacactcagggtctccaatcaggtttg ttttgttctaagagcaatggggaggaactggagccgcggggaccctcgcgccgtcgccgc cgccgcctggatccccgcaccaggccagcggagaagaccttcaagcagcgccgcaccttc caacaaagagtagatgtccgacttattcaagagcagcatccagccaaaagcccggtgata acagaacgataccaaggtgagaagcagcttcctgtcctggataaaacgaagttccttgta cctgacgttcgagacgtcaacatgagtgagctcatcaagataattagaaggcgcttacag ctcaatgctaatcaagccttcctcctggtggtgaacggacatagcatggtgagcgtctcc acaccagtctcagaggtgtatgagagtgagaaggatgaagatggattcctatacatggtc tatgcctcccaggagaccactgttgattcctctccactacggggatttggcagagtcagt ggccttctcccctccccagatctgagctgctccatccgtccccataaccctctgcaggca gcataa