GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:46:59 Sequence gi568815581r:40454745_40658943 : 204199 bp : 46.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1705 1835 131 0 2 111 42 139 0.309 11.09 1.02 Intr + 5333 5476 144 2 0 103 44 62 0.267 2.70 1.03 Intr + 5866 5947 82 1 1 129 52 39 0.161 4.04 1.04 Intr + 7610 7683 74 1 2 101 36 42 0.264 -1.50 1.05 Intr + 12864 13005 142 0 1 110 49 64 0.115 5.06 1.06 Term + 15288 15425 138 2 0 -12 46 125 0.050 -3.74 1.07 PlyA + 17301 17306 6 1.05 2.12 PlyA - 21113 21108 6 1.05 2.11 Term - 22985 22844 142 1 1 93 50 178 0.691 11.90 2.10 Intr - 23589 23563 27 2 0 120 95 26 0.930 3.73 2.09 Intr - 23904 23836 69 2 0 103 100 81 0.997 9.10 2.08 Intr - 25098 24930 169 1 1 88 72 238 0.985 21.20 2.07 Intr - 27462 27385 78 1 0 100 90 77 0.980 8.62 2.06 Intr - 27672 27580 93 1 0 101 77 180 0.553 18.14 2.05 Intr - 29913 29740 174 1 0 7 85 133 0.757 4.81 2.04 Intr - 30263 30177 87 0 0 121 93 102 0.998 13.94 2.03 Intr - 32542 32292 251 0 2 27 83 202 0.571 10.68 2.02 Intr - 34225 33802 424 2 1 92 100 159 0.768 10.42 2.01 Init - 41681 41243 439 2 1 92 82 357 0.994 29.78 2.00 Prom - 42182 42143 40 -8.36 3.00 Prom + 44921 44960 40 -5.26 3.01 Init + 48045 48104 60 2 0 67 80 51 0.528 1.51 3.02 Intr + 51676 51780 105 0 0 108 77 43 0.890 5.61 3.03 Intr + 55402 55653 252 0 0 48 105 96 0.854 4.93 3.04 Intr + 57399 57621 223 2 1 110 47 58 0.713 1.60 3.05 Intr + 59335 59418 84 2 0 32 83 81 0.558 1.79 3.06 Intr + 64751 64776 26 0 2 85 90 25 0.128 0.14 3.07 Intr + 68770 68911 142 0 1 -49 101 130 0.096 0.53 3.08 Term + 76938 77131 194 1 2 26 50 178 0.420 5.38 3.09 PlyA + 80799 80804 6 1.05 4.03 PlyA - 81040 81035 6 1.05 4.02 Term - 87099 86894 206 1 2 116 43 80 0.536 3.83 4.01 Init - 88236 88227 10 0 1 59 116 0 0.452 0.58 4.00 Prom - 92106 92067 40 -4.16 5.05 PlyA - 92764 92759 6 1.05 5.04 Term - 101074 99998 1077 1 0 85 48 1745 0.994 162.32 5.03 Intr - 104420 104149 272 0 2 110 94 76 0.546 7.66 5.02 Intr - 104733 104577 157 0 1 66 49 70 0.525 0.48 5.01 Init - 107964 107761 204 0 0 73 59 108 0.568 5.25 5.00 Prom - 109702 109663 40 -4.36 6.00 Prom + 109806 109845 40 -3.96 6.01 Sngl + 115201 115533 333 0 0 48 44 225 0.460 10.12 6.02 PlyA + 115543 115548 6 1.05 7.00 Prom + 118875 118914 40 -1.86 7.01 Init + 157630 157687 58 0 1 68 86 39 0.298 3.17 7.02 Term + 164443 164528 86 2 2 99 49 81 0.161 3.12 7.03 PlyA + 168576 168581 6 1.05 8.11 PlyA - 168637 168632 6 1.05 8.10 Term - 174324 174041 284 1 2 27 42 246 0.829 9.49 8.09 Intr - 176180 175970 211 2 1 11 91 302 0.977 21.29 8.08 Intr - 176949 176848 102 0 0 75 32 126 0.964 6.07 8.07 Intr - 177623 177451 173 0 2 55 103 128 0.870 10.66 8.06 Intr - 181358 181187 172 2 1 25 53 150 0.876 4.72 8.05 Intr - 181782 181651 132 0 0 99 97 115 0.999 14.34 8.04 Intr - 182828 182748 81 2 0 75 100 58 0.949 5.53 8.03 Intr - 187815 187711 105 0 0 69 70 40 0.136 0.71 8.02 Intr - 201886 201724 163 0 1 82 -2 110 0.222 1.38 8.01 Intr - 202743 202608 136 1 1 47 89 51 0.508 0.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:40454745_40658943|GENSCAN_predicted_peptide_1|236_aa CHPALDGQRGKCWCVDRKTGVKLPGGLEPKGELDCHQLADSFREKRLVVLTPLYPEPHSA TKLLQECYEACRIWDACSVLFPSVGGCYVAVWSPAPFPSTWTIPAKLGEALVEPGSSNSS SAQLQIVKWGLTMPDPWSPGVALRVFAHAGPSAFNAPNAPSSTPPDNGFFFSSHLSHHHL LREALPDFPEQGIGFDDLGLNKLQMSAFISITIVIIIISSHGIRLPGFRSHSATYG >gi568815581r:40454745_40658943|GENSCAN_predicted_CDS_1|711_bp tgtcacccagctctggatgggcagcgtggcaagtgctggtgtgtggaccggaagacgggg gtgaagcttccggggggcctggagccaaagggggagctggactgccaccagctggctgac agctttcgagagaaaaggctggttgtcctaaccccgctgtaccctgagccacattctgcc accaagctgctccaggaatgttacgaggcatgcaggatttgggatgcctgctcggtcctc tttcccagtgtggggggctgttatgtggctgtgtggtccccagcccccttcccttctacc tggacaattccagccaaactgggggaggcgctggtggagcctgggagcagcaactcgagc tcagctcagctgcagattgtcaagtgggggctgaccatgcctgacccctggtctcctgga gtagccttaagggtctttgcacatgctggtccctctgctttcaatgctcccaatgccccc tcctccacaccccctgacaatggatttttcttcagctctcatctcagccatcatcacctc ctcagggaagccctccctgacttccctgagcagggtattggatttgatgaccttgggctc aataaactccaaatgtctgctttcatctccatcaccatcgtcatcattatcatcagcagc catgggatcagattgcctgggttcagatcccactctgccacttatggttga >gi568815581r:40454745_40658943|GENSCAN_predicted_peptide_2|650_aa MSQVMSSPLLAGGHAVSLAPCDEPRRTLHPAPSPSLPPQCSYYTTEGWGAQALMAPVPCM GPPGRLQQAPQVEAKATCFLPSPGEKALGTPEDLDSYIDFSLESLNQMILELDPTFQLLP PGTGGSQAELAQSTMSMRKKEESEALDIKYIEVTSARSRCHDGPQHCSSPSVTPPFGSLR SGGLLLSRDVPRETRSSSESLIFSGNQGRGHQRPLPPSEGLSPRPPNSPSISIPCMGSKA SSPHGLGSPLVASPRLEKRLGGLAPQRGSRISVLSASPVSDVSYMFGRTPHSPPLAKEHA SSCPPSITNSMVDIPIVLINGCPEPGSSPPQRTPGHQNSVQPGAASPSNPCPATRSNSQT LSDAPFTTCPEGPARDMQPTMKFVMDTSKYWFKPNITREQGCPGGAVSISDSRIVPAIEL LRKEEPGAFVIRDSSSYRGSFGLALKVQEVPASAQSRPGEDSNDLIRHFLIESSAKGVHL KGADEEPYFGSLSAFVCQHSIMALALPCKLTIPQRGCHTLYLSSVSVETLTGALAVQKAI STTFERDILPTPTVVHFKVTEQGITLTDVQRKVFFRRHYPLTTLRFCGMDPEQRKWQKYC KPSWIFGFVAKSQTEPQENVCHLFAEYDMVQPASQVIGLVTALLQDAERM >gi568815581r:40454745_40658943|GENSCAN_predicted_CDS_2|1953_bp atgtcccaggtgatgtccagcccactgctggcaggaggccatgctgtcagcttggcgcct tgtgatgagcccaggaggaccctgcacccagcacccagccccagcctgccaccccagtgt tcttactacaccacggaaggctggggagcccaggccctgatggcccccgtgccctgcatg gggccccctggccgactccagcaagccccacaggtggaggccaaagccacctgcttcctg ccgtcccctggtgagaaggccttggggaccccagaggaccttgactcctacattgacttc tcactggagagcctcaatcagatgatcctggaactggaccccaccttccagctgcttccc ccagggactgggggctcccaggctgagctggcccagagcaccatgtcaatgagaaagaag gaggaatctgaagccttggacataaagtacatcgaggtgacctccgccagatcaaggtgc cacgatggcccccagcactgctccagcccctctgtcaccccgcccttcggctcccttcgc agtggtggcctcctcctttccagagacgtcccccgagagacacgaagcagcagtgagagc ctcatcttctctgggaaccagggcagggggcaccagcgccctctgcccccctcagagggt ctctcccctcgacccccaaattcccccagcatctcaatcccttgcatggggagcaaggcc tcgagcccccatggtttgggctccccgctggtggcttctccaagactggagaagcggctg ggaggcctggccccacagcggggcagcaggatctctgtgctgtcagccagcccagtgtct gatgtcagctatatgtttggaagaacaccccactctccaccactggccaaagaacatgcc agcagctgccccccatccatcaccaactccatggtggacatacccattgtgctgatcaac ggctgcccagaaccagggtcttctccaccccagcggaccccaggacaccagaactccgtt caacctggagctgcttctcccagcaacccctgtccagccaccaggagcaacagccagacc ctgtcagatgccccctttaccacatgcccagagggtcccgccagggacatgcagcccacc atgaagttcgtgatggacacatctaaatactggtttaagccaaacatcacccgagagcaa gggtgtccaggtggggcggtgtccatctctgactcaaggattgttccagcaatcgagctg ctgaggaaggaggagccaggggcttttgtcataagggacagctcttcataccgaggctcc ttcggcctggccctgaaggtgcaggaggttcccgcgtctgctcagagtcgaccaggtgag gacagcaatgacctcatccgacacttcctcatcgagtcgtctgccaaaggagtgcatctc aaaggagcagatgaggagccctactttgggagcctctctgccttcgtgtgccagcattcc atcatggccctggccctgccctgcaaactcaccatcccacagagaggctgccacaccctg tacctgagctcagtgagcgtggagaccctgactggagccctggccgtgcagaaagccatc tccaccacctttgagagggacatcctccccacgcccaccgtggtccacttcaaagtcaca gagcagggcatcactctgactgatgtccagaggaaggtgtttttccggcgccattaccca ctcaccaccctccgcttctgtggtatggaccctgagcaacggaagtggcagaagtactgc aaaccctcctggatctttgggtttgtggccaagagccagacagagcctcaggagaacgta tgccacctctttgcggagtatgacatggtccagccagcctcgcaggtcatcggcctggtg actgctctgctgcaggacgcagaaaggatgtag >gi568815581r:40454745_40658943|GENSCAN_predicted_peptide_3|361_aa MAASPMRLWLMIVGEWEGRLVTDENGWGQGIDWPEQWEFSVVGSGDTRGAFKQGKLDLKS KSAICAQKQIGGGLGTAVLGLALLDLSLSLEEGKCRAALGGIGTCAAMTQAAGPSVQTLA RRRASPASSCAILHGPRPKTPRSSVPGTELRNSAEETTGISFEEEFGPWSPLPTHWWEGA KTSVAAATPANPKPQECNGVTWGQREMDGQRVGRGAHSLVEKADLQFNDGYFMMSLLCRG GDPQQNCTERKKEQLLFRKVGEDFVKVGMFDLNLEEQAETQEVQKRGGGNSMNEVGHDFF LMCSDEAAAMLEAAPWNGQRAEEQRAQAVKGIGALNPVVHEELNPASNHSSKLGIGSPIE P >gi568815581r:40454745_40658943|GENSCAN_predicted_CDS_3|1086_bp atggcggcttcccccatgaggctgtggctcatgattgtgggagagtgggaggggcgactc gtgactgatgagaatgggtggggccagggtatagactggcctgagcagtgggaattttct gtggtgggcagtggagatacacggggagcctttaagcagggcaagttagatctgaaaagc aaatcagccatctgtgcccagaaacagataggcggggggctgggcactgccgtccttggg cttgccctcctggacctctcgctgtccttggaggaggggaagtgcagggctgctcttgga ggcatcggtacctgtgcagcaatgacccaagctgctggaccgtctgtccagacgctggcc aggagaagggcatccccggccagctcgtgtgcgatcctgcatggccctaggcccaagacg cctcggagttctgtgccaggaaccgagctgaggaactcagctgaggaaaccacgggaatc tcatttgaagaggaatttggcccctggagccccctccccactcactggtgggagggtgcc aagacttcagtcgccgccgccacaccagcaaaccccaaaccacaggaatgcaatggggtc acctggggacagagggagatggatggacagcgagtaggcagaggagctcacagtctagtg gagaaggcagacctgcagtttaacgatggctatttcatgatgtccttgctctgccgagga ggagacccgcagcagaactgcacagagaggaagaaagagcagttactcttccggaaggtt ggagaagattttgtgaaggtggggatgtttgatttgaaccttgaagaacaagcagagacc caagaagtacagaagaggggaggtggaaatagcatgaacgaggtggggcatgacttcttt cttatgtgttctgatgaagcagctgccatgttggaggctgccccatggaacggccaacgt gctgaggaacagagggcacaggcagtgaagggcatcggggccttgaatccagtggtccac gaggaactgaatcctgccagcaaccactcgagtaagcttggtattgggtccccaattgag ccttga >gi568815581r:40454745_40658943|GENSCAN_predicted_peptide_4|71_aa MAKGHCGRGEDSGVKVRLPEPGCCYYYPWGGPARTSHPSTVSRATRQSDCLSSATVPGLF NSGYQDRQPRP >gi568815581r:40454745_40658943|GENSCAN_predicted_CDS_4|216_bp atggctaaagggcactgtgggcgaggtgaggactccggtgtgaaagtgcgactgccagag cctggctgttgttactactacccatgggggggccctgcccgcacctcccacccctccacc gtgtcccgagccacaaggcagagtgactgcctctcttcagccacagtccctgggttattc aacagcggctaccaggacaggcagccaaggccctaa >gi568815581r:40454745_40658943|GENSCAN_predicted_peptide_5|569_aa MRRKDAGTGDPILKTWYGIPKEFELYPENNGDPSKIFIFQVGELKDEICVLEGVLWQQRG SKARVRTEETLRLGFMHVWLRTHLDARKMLLGQWLPAPKQGWKRVHLPHDLIALPPHPTS GPGSGGGQEGGEDSDAGQQARCSKAQSGSDVLFQQTQCLELGRKVPNSVSPSTAFFNNKD LSLPSNNFLALSPTGKPMKSVLVVALLVIFQVCLCQDEVTDDYIGDNTTVDYTLFESLCS KKDVRNFKAWFLPIMYSIICFVGLLGNGLVVLTYIYFKRLKTMTDTYLLNLAVADILFLL TLPFWAYSAAKSWVFGVHFCKLIFAIYKMSFFSGMLLLLCISIDRYVAIVQAVSAHRHRA RVLLISKLSCVGIWILATVLSIPELLYSDLQRSSSEQAMRCSLITEHVEAFITIQVAQMV IGFLVPLLAMSFCYLVIIRTLLQARNFERNKAIKVIIAVVVVFIVFQLPYNGVVLAQTVA NFNITSSTCELSKQLNIAYDVTYSLACVRCCVNPFLYAFIGVKFRNDLFKLFKDLGCLSQ EQLRQWSSCRHIRRSSMSVEAETTTTFSP >gi568815581r:40454745_40658943|GENSCAN_predicted_CDS_5|1710_bp atgaggaggaaggacgcaggcacaggagacccaatccttaaaacgtggtacggcatccct aaggaatttgaactttatcctgaaaacaatggggacccatcaaagatttttatttttcaa gtaggtgaattaaaggatgagatttgtgttttggaaggagttctgtggcagcaaagaggc agcaaagcccgtgtcagaacagaggagacactcaggcttggcttcatgcacgtgtggctg cgcactcacctggatgccaggaaaatgctcctgggccagtggctaccagcaccaaagcag ggctggaaacgtgtgcacctaccccacgacctcatagctttgccgcctcatcccacatca ggacctgggagtggaggaggacaagaaggaggtgaggacagtgatgccgggcagcaggcc aggtgttcaaaggcacaatctggttctgatgttctctttcagcaaacacagtgcctggag cttgggaggaaagttcccaacagcgtctccccctccactgctttctttaataacaaagac ttgtccctgccaagcaataactttctcgccttgtctcctacagggaaaccaatgaaaagc gtgctggtggtggctctccttgtcattttccaggtatgcctgtgtcaagatgaggtcacg gacgattacatcggagacaacaccacagtggactacactttgttcgagtctttgtgctcc aagaaggacgtgcggaactttaaagcctggttcctccctatcatgtactccatcatttgt ttcgtgggcctactgggcaatgggctggtcgtgttgacctatatctatttcaagaggctc aagaccatgaccgatacctacctgctcaacctggcggtggcagacatcctcttcctcctg acccttcccttctgggcctacagcgcggccaagtcctgggtcttcggtgtccacttttgc aagctcatctttgccatctacaagatgagcttcttcagtggcatgctcctacttctttgc atcagcattgaccgctacgtggccatcgtccaggctgtctcagctcaccgccaccgtgcc cgcgtccttctcatcagcaagctgtcctgtgtgggcatctggatactagccacagtgctc tccatcccagagctcctgtacagtgacctccagaggagcagcagtgagcaagcgatgcga tgctctctcatcacagagcatgtggaggcctttatcaccatccaggtggcccagatggtg atcggctttctggtccccctgctggccatgagcttctgttaccttgtcatcatccgcacc ctgctccaggcacgcaactttgagcgcaacaaggccatcaaggtgatcatcgctgtggtc gtggtcttcatagtcttccagctgccctacaatggggtggtcctggcccagacggtggcc aacttcaacatcaccagtagcacctgtgagctcagtaagcaactcaacatcgcctacgac gtcacctacagcctggcctgcgtccgctgctgcgtcaaccctttcttgtacgccttcatc ggcgtcaagttccgcaacgatctcttcaagctcttcaaggacctgggctgcctcagccag gagcagctccggcagtggtcttcctgtcggcacatccggcgctcctccatgagtgtggag gccgagaccaccaccaccttctccccatag >gi568815581r:40454745_40658943|GENSCAN_predicted_peptide_6|110_aa MKEVDFRMWIKMNVAELKKQVVTQCKEAKNHDKTMQELTAKIDIIEKNITDLIELKNTLQ ELHNAIRSIHRRIGQVLLSLPLKDSSSERISELEDCLYEIIRADKNREKE >gi568815581r:40454745_40658943|GENSCAN_predicted_CDS_6|333_bp atgaaagaagtagacttcagaatgtggataaaaatgaacgtcgctgagctaaagaagcaa gttgtaacccaatgcaaggaagctaaaaatcatgataaaacaatgcaggagctgacagcc aaaatagatattatagagaagaacataactgacctgatcgagctgaaaaacacactacaa gaacttcacaatgcaatcagaagtattcatagaagaataggccaagttctgcttagtctt cctctgaaagattcttcctctgaaagaatctcagagcttgaagactgtctttatgaaata atacgggcagataagaatagagaaaaagaatga >gi568815581r:40454745_40658943|GENSCAN_predicted_peptide_7|47_aa MIATFRSHQPLNFIEDAGKARPLARPLPNGSGCVQVPAERRSYLSAV >gi568815581r:40454745_40658943|GENSCAN_predicted_CDS_7|144_bp atgattgctactttcaggagccaccagcctctcaactttatcgaggatgcaggaaaagcc cggcccttggcccgacccctgccaaatggctccggctgtgttcaggtgcctgcggaaagg cgctcctacctgtcagcagtgtaa >gi568815581r:40454745_40658943|GENSCAN_predicted_peptide_8|519_aa XIINEISFTTKVPQKYENENVETVTKQAILNGSIVKESTEAHGTIQTEKVDEVIKEWEGS FFKDNPRLRKKSVSLRFDLHLAATDEGCLETKQDNLPDIEQMPSTPGFVGYNPYSHLAYN NYRLGGNPGTNSRVTASSGITIPKPPKPPDKPLMPYMRYSRKVWDQVKASNPDLKLWEIG KIIGGMWRDLTDEEKQEYLNEYEAEKIEYNESMKAYHNSPAYLAYINAKSRAEAALEEES RQRQSRMEKGEPYMSIQPAEDPDDYDDGFSMKHTATARFQRNHRLISEILSESVVPDVRS VVTTARMQVLKRQVQSLMVHQRKLEAELLQIEERHQEKKRKFLESTDSFNNELKRLCGLK VEVDMEKIAAEIAQAEEQARKRQEEREKEAAEQAERSQSSIVPEEEQAANKGEEKKDDEN IPMETAQLYTVSITRVKFHLGTDSDKCVITEETHLEETTESQQNGEEGTSTPEDKESGQE GVDSMAEEGTSDSNTGSESNSATVEEPPTDPIPEDEKKE >gi568815581r:40454745_40658943|GENSCAN_predicted_CDS_8|1560_bp nccattataaatgaaatatcttttactacaaaagtcccacaaaagtatgagaatgaaaat gtagaaacagtaaccaaacaggcaatcttaaatgggagtatcgttaaggagagcactgaa gctcatggcactattcagacagagaaagtggatgaagttattaaagaatgggaaggttct ttctttaaagataaccctcgattgaggaaaaagtctgtttctcttcgatttgatcttcat ttagcagccactgatgaagggtgtttagagactaagcaggataatctaccagatatagaa caaatgcccagcacaccagggtttgtgggatacaatccatacagtcatctcgcctacaac aactacaggctgggagggaacccgggcaccaacagccgggtcacggcatcctctggtatc acgattccaaaacccccaaagccaccagataagccgctgatgccctacatgaggtacagc agaaaggtctgggaccaagtaaaggcttccaaccctgacctaaagttgtgggagattggc aagattattggtggcatgtggcgagatctcactgatgaagaaaaacaagaatatttaaac gaatacgaagcagaaaagatagagtacaatgaatctatgaaggcctatcataattccccc gcgtaccttgcttacataaatgcaaaaagtcgtgcagaagctgctttagaggaagaaagt cgacagagacaatctcgcatggagaaaggagaaccgtacatgagcattcagcctgctgaa gatccagatgattatgatgatggcttttcaatgaagcatacagccaccgcccgtttccag agaaaccaccgcctcatcagtgaaattcttagtgagagtgtggtgccagacgttcggtca gttgtcacaacagctagaatgcaggtcctcaaacggcaggtccagtccttaatggttcat cagcgaaaactagaagctgaacttcttcaaatagaggaacgacaccaggagaagaagagg aaattcctggaaagcacagattcatttaacaatgaacttaaaaggttgtgcggtctgaaa gtagaagtggatatggagaaaattgcagctgagattgcacaggcagaggaacaggcccgc aaaaggcaggaggaaagggagaaggaggccgcagagcaagctgagcgcagtcagagcagc atcgttcctgaggaagaacaagcagctaacaaaggcgaggagaagaaagacgacgagaac attccgatggagacagcacagctgtatactgtcagcataactagagtaaaatttcatctt ggaactgacagtgacaaatgtgtgattacagaggagacacaccttgaagaaacaacagag agccaacagaatggtgaagaaggcacgtctactcctgaggacaaggagagtgggcaggag ggggtcgacagtatggcagaggaaggaaccagtgatagtaacactggctcggagagcaac agtgcaacagtggaggagccaccaacagatcccataccagaagatgagaaaaaagaataa