GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:51:50 Sequence gi568815596r:108796910_109031014 : 234105 bp : 43.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 6056 4378 1679 2 2 39 53 490 0.491 31.72 1.00 Prom - 8382 8343 40 -1.76 2.00 Prom + 10122 10161 40 -3.66 2.01 Init + 13490 13495 6 1 0 39 69 11 0.682 -5.46 2.02 Intr + 15722 15799 78 1 0 54 111 41 0.835 2.75 2.03 Intr + 15911 16018 108 1 0 60 115 80 0.959 8.38 2.04 Intr + 19032 19196 165 2 0 52 76 76 0.821 2.96 2.05 Term + 19612 19665 54 0 0 116 51 6 0.577 -2.74 2.06 PlyA + 21511 21516 6 1.05 3.03 PlyA - 21833 21828 6 1.05 3.02 Term - 39438 39375 64 2 1 48 44 85 0.602 -2.54 3.01 Init - 43021 42963 59 1 2 80 97 50 0.861 5.88 3.00 Prom - 86028 85989 40 -2.96 4.14 PlyA - 88634 88629 6 1.05 4.13 Term - 95075 94639 437 0 2 78 49 97 0.797 0.25 4.12 Intr - 98804 98755 50 1 2 83 91 37 0.778 1.82 4.11 Intr - 100320 100065 256 1 1 74 -9 402 0.590 25.60 4.10 Intr - 109459 109399 61 1 1 71 121 96 0.786 9.51 4.09 Intr - 111110 110951 160 1 1 130 60 256 0.538 26.99 4.08 Intr - 113623 113551 73 2 1 98 87 127 0.892 12.06 4.07 Intr - 113941 113867 75 2 0 112 62 115 0.992 10.79 4.06 Intr - 114163 114038 126 2 0 113 66 269 0.992 27.85 4.05 Intr - 115855 115769 87 2 0 109 100 32 0.871 6.44 4.04 Intr - 126544 126459 86 0 2 124 82 91 0.976 11.56 4.03 Intr - 132470 132289 182 2 2 85 94 292 0.993 28.17 4.02 Intr - 133333 133211 123 1 0 73 95 85 0.894 8.48 4.01 Init - 134105 134055 51 2 0 78 103 67 0.990 6.94 4.00 Prom - 136403 136364 40 -5.96 5.03 PlyA - 137664 137659 6 1.05 5.02 Term - 139278 138981 298 2 1 73 47 186 0.753 7.74 5.01 Init - 139446 139403 44 0 2 69 115 87 0.833 7.40 5.00 Prom - 142250 142211 40 -7.86 6.08 PlyA - 143214 143209 6 1.05 6.07 Term - 143417 143298 120 2 0 107 38 94 0.693 4.77 6.06 Intr - 145869 145725 145 2 1 4 43 107 0.253 -1.92 6.05 Intr - 146902 146703 200 1 2 47 81 109 0.613 4.25 6.04 Intr - 149272 149201 72 1 0 59 70 87 0.608 3.60 6.03 Intr - 150864 150300 565 2 1 98 39 94 0.008 -1.70 6.02 Intr - 164666 164568 99 1 0 90 42 55 0.001 0.33 6.01 Init - 188621 188428 194 2 2 86 70 203 0.676 16.74 6.00 Prom - 189060 189021 40 -5.76 7.08 PlyA - 189844 189839 6 1.05 7.07 Term - 195356 195279 78 2 0 50 49 98 0.120 -0.14 7.06 Intr - 204248 204133 116 0 2 83 60 57 0.427 2.57 7.05 Intr - 209044 208991 54 2 0 137 52 53 0.214 5.55 7.04 Intr - 215551 215437 115 1 1 85 58 48 0.003 1.52 7.03 Intr - 222049 221990 60 1 0 93 77 43 0.849 2.63 7.02 Intr - 225008 224855 154 1 1 89 109 56 0.674 7.87 7.01 Intr - 227581 227478 104 1 2 66 51 100 0.803 3.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:108796910_109031014|GENSCAN_predicted_peptide_1|560_aa MQGWFNIRKSINVIQHINRAKDKNHVIISTDAEKAFDKIQQPFMLKTLNKLGIDGTYFKI IRAIYDKPTADIILNGQKLEAFPLKTGTRQGCLLSPLLFNIVLEVLARAIRQEKEIKGIQ LGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQT ESHIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGR INIVKMAILPKVIYRFNAIPIKLPMTFFTELEKPTLKFIWNQKRARIAKSILSQKNKAGA ITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHTYNYLIFDKPEKNKQWGKD SLFNKWCWKNWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIG VGKDFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFTTYSSDKGL ISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNNS >gi568815596r:108796910_109031014|GENSCAN_predicted_CDS_1|1680_bp atgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacagagcc aaagacaaaaaccacgtgattatctcaacagatgcagaaaaagcctttgacaaaattcaa caacccttcatgctgaaaactctcaataaattaggtattgatgggacgtatttcaaaata ataagagctatctatgacaaacccacagccgatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcacaagacagggatgccttctctcaccactcctattcaac atagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaaaggtattcag ttaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtttatctagaa aaccctatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca ggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaaaca gagagccacatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaacctactttaaagttcatatgg aaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggagcc atcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtac tggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacgccg catacctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggat tccctatttaataaatggtgctggaaaaactggctagccatatgtagaaagctgaaactg gatcccttccttacaccttatacaaaaatcaattcaagatggattaaagatttaaacgtt agacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacataggc gtgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaagacaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacaacatgggagaaaattttcacaacctactcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaa aaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccact atgagataccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagn >gi568815596r:108796910_109031014|GENSCAN_predicted_peptide_2|136_aa MILNEASEENRKIDIQAKRVQARLDNLQRKYEFMTIQRLKGSSHAVHEMKSLKQEKAPVS KTYKVPLNGQVYELLTVFMDWISDHHLSKVKHEESGMDGKKPQLKFASQRNDIQEKCVKS PEVMQGITWLGAECAS >gi568815596r:108796910_109031014|GENSCAN_predicted_CDS_2|411_bp atgatcttaaatgaagcaagtgaagaaaacaggaagatagacattcaggctaaaagagtt caagctcgtttagataatttacagaggaagtacgagtttatgacaatacagagattgaaa ggaagttcccatgctgttcatgaaatgaaaagtttaaaacaagaaaaagcaccagtttca aaaacttacaaggtaccacttaatgggcaagtttatgaacttttaactgtcttcatggac tggatttcggatcatcatcttagcaaagtgaaacatgaagaatctggaatggatggtaaa aaaccacaactcaaatttgcttcccagagaaatgatattcaggagaagtgtgtaaagagt ccagaggtgatgcagggtatcacatggctgggggctgagtgtgctagctga >gi568815596r:108796910_109031014|GENSCAN_predicted_peptide_3|40_aa MTTLTIIFDILLETLANAIRKEGNPVTCYNMDEPYGYYAK >gi568815596r:108796910_109031014|GENSCAN_predicted_CDS_3|123_bp atgactactctcaccatcatattcgacatactgctggaaactctagctaatgcaataaga aaggaaggaaatcctgtcacatgctacaacatggacgaaccttatggatattatgctaag tga >gi568815596r:108796910_109031014|GENSCAN_predicted_peptide_4|588_aa MAHVGDCTQTPWLPVLVVSLMCSARAEYSNCGENEYYNQTTGLCQECPPCGPGEEPYLSC GYGTKDEDYGCVPCPAEKFSKGGYQICRRHKDCEGFFRATVLTPGDMENDAECGPCLPGY YMLENRPRNIYGMVCYSCLLAPPNTKECVGATSGASANFPGTSGSSTLSPFQHAHKELSG QGHLATALIIAMSTIFIMAIAIVLIIMFYILKTKPSAPACCTSHPGKSVEAQVSKDEEKK EAPDNVVMFSEKDEFEKLTATPAKPTKSENDASSENEQLLSRSVDSDEEPAPDKQGSPEL CLLSLVHLAREKSATSNKSAGIQSRRKKILDVYANVCGVVEGLSPTELPFDCLEKTSRML SSTYNSEKAVVKTWRHLAESFGLKRDEIGGMTDGMQLFDRISTAGYSIPELLTKLVQIER LDAVESFFSSANEMLVSDSIIGADRAHILGSWELCNSIHHLGYLSIFLGPMVGPKLPATI TSPGTSLQVSPEGQEVVLPSPSQQPPTSMHTTWDPENRPATATAIAHAMPAAQRPKNPPT HPIHCCHYQQLNKPPGGPEISLPNNCQQRCQLYSPGHKQAHLAHCCHH >gi568815596r:108796910_109031014|GENSCAN_predicted_CDS_4|1767_bp atggcccatgtgggggactgcacgcagacgccctggctccccgtcctggtggtgtctctg atgtgctcagcccgagcggaatactcaaactgcggtgagaacgagtactacaaccagact acggggctgtgccaggagtgccccccgtgtgggccgggagaggagccctacctgtcctgt ggctacggcaccaaagacgaggactacggctgcgtcccctgcccggcggagaagttttcc aaaggaggctaccagatatgcaggcgtcacaaagactgtgagggcttcttccgggccacc gtgctgacaccaggggacatggagaatgacgctgagtgtggcccttgcctccctggctac tacatgctggagaacagaccgaggaacatctatggcatggtctgctactcctgcctcctg gcaccccccaacaccaaggaatgtgtgggagccacttcaggagcttctgccaacttccct ggcacctcgggcagcagcaccctgtctcccttccagcacgcccacaaagaactctcaggc caaggacacctggccactgccctgatcattgcaatgtccaccatcttcatcatggccatc gccatcgtcctcatcatcatgttctacatcctgaagacaaagccctctgccccagcctgt tgcaccagccacccggggaagagcgtggaggcccaagtgagcaaggacgaggagaagaaa gaggccccagacaacgtggtgatgttctccgagaaggatgaatttgagaagctgacagca actccagcaaagcccaccaagagcgagaacgatgcctcatccgagaatgagcagctgctg agccggagcgtcgacagtgatgaggagcccgcccctgacaagcagggctccccggagctg tgcctgctgtcgctggttcacctggccagggagaagtctgccaccagcaacaagtcagcc gggattcaaagccggaggaaaaagatcctcgatgtgtatgccaacgtgtgtggagtcgtg gaaggtcttagccccacggagctgccatttgattgcctcgagaagactagccgaatgctc agctccacgtacaactctgagaaggctgttgtgaaaacgtggcgccacctcgccgagagc ttcggcctgaagagggatgagattgggggcatgacagacggcatgcaactctttgaccgc atcagcacggcaggctacagcatccctgagctactcacaaaactggtgcagattgagcgg ctggatgctgtggagtcctttttctcgtctgccaatgagatgttagttagtgattctata attggggcagacagggcacatattctggggtcctgggaactgtgcaattcaatccaccac cttgggtacctgagcatctttctggggcctatggttgggcctaaactcccagccaccatc acatcacctggtacctccttgcaagtgtcacctgaaggccaagaggttgtcttacccagt ccatcacagcaaccaccaacatcaatgcacaccacttgggacccagagaatcgtcctgct actgctactgccattgctcatgctatgccagctgcccaaaggcccaaaaacccacctaca cacccaatccattgctgccactaccagcaactgaacaagccacctggaggcccagaaatc agcctgccaaataactgccaacagagatgccagctatacagccctgggcacaaacaggca cacttagcccattgctgtcatcactag >gi568815596r:108796910_109031014|GENSCAN_predicted_peptide_5|113_aa MRSTQAWQALLGQARYEVWPFQGEAFGRGSTVTVKGGPTGWKNGKSRRLDHLPAEAVTLL LLKVVTSREDGHLLPAVGTVCALEAAVMTSIDSQSHTTKGEGHSKRRAAQRGD >gi568815596r:108796910_109031014|GENSCAN_predicted_CDS_5|342_bp atgcggtcaacccaggcgtggcaagccctgctgggccaggccagatatgaggtgtggccc tttcagggagaagcttttggaagaggtagcaccgtcacagtcaaggggggcccaactggg tggaagaatgggaagtcccggaggctggaccatctgccagccgaggcagtgactctgctt ctcctaaaggtggtgaccagtagggaggacggccaccttcttcctgcagtgggcactgtc tgtgctctggaggcagccgtgatgacaagtatagacagtcaaagccacaccaccaagggt gaaggccattccaagaggcgagcagctcagagaggagactga >gi568815596r:108796910_109031014|GENSCAN_predicted_peptide_6|464_aa MPLFSISEKTEDYGKQVGRKEPYRGRCTCGGSPACTTLRAHASLTHFTVNQTYCSKHSRK ARLHLTSITKRAVQPSACKSDVGSGNQELYLPLKRAWLPGGLGGKNGFMGPVQPQDLASC IPATPAPAAAKRGQGTAQAIASEGASPKPWQLPLGVGTAGVQKTRIELWESPPRFQSMYE NAWMSRQKSTAGVKPSWRTSTKAMQLEPPHRVPTGALPSGAVRRAPPFSRPENGRSTDSL HHALGKSTGTQHQPMKAVTGAVPCRATEAELSKAMGAHPLHQHALDLLTAAGTHSEHISQ GVDDCQEEGLPEVCCGATQSILEKGGVLSPMDSSPPNWRADTSHQERAKKGWLLLLNTPG HGKYAESSLWTFCCARKARTFVRVQMARVVPGSEAGFVVKGEAGTGTRWNRRAPAVLKGL WRGKQTIRAILGGSASTTHVGTFCITYNSQPERSSKPSAGPRDT >gi568815596r:108796910_109031014|GENSCAN_predicted_CDS_6|1395_bp atgcctttgttttccatttccgagaaaacagaggactatggaaaacaggtgggccgtaag gagccctacagaggaagatgcacctgtggcggtagccctgcctgcaccaccctgcgagct catgccagcttaactcacttcacggtcaatcagacctactgctccaaacacagtaggaag gcgcgccttcacttaacgtccataaccaaaagagcagttcaaccttcagcatgtaaatct gatgtaggcagcgggaaccaggagctctacctccctctgaagagggcctggctacccgga ggcctgggagggaaaaatggtttcatgggccctgtgcagccccaggacttggcatcctgc atcccagccactccagctccagctgcagctaaaaggggccaaggtacagctcaggccatt gcttcagagggtgcaagtcccaagccttggcaacttccacttggtgttgggactgcgggg gtgcagaagacaagaattgagctttgggagtctccacctagatttcagagtatgtatgaa aatgcctggatgtccaggcagaagtctactgcaggggtgaagccctcatggagaacctct accaaggcaatgcagttggagcccccacacagagtccccactggggcactgcctagtgga gctgtgagacgagcaccaccattttccagacccgagaatggtagatccaccgacagcttg caccatgcacttggaaaatccacaggcactcaacaccagcccatgaaagcagttacagga gctgtaccctgcagagccacagaggcagagctgtccaaagccatgggagcccaccccttg catcagcatgccctggatctcctcacagctgctgggacacactccgagcacatctcacag ggtgtggacgactgccaggaggaggggctgcctgaggtgtgctgcggagcaacacagagc atcctggagaaaggaggggtgctcagccccatggattcatctcccccaaactggagagca gacaccagccaccaagaaagagccaagaaaggatggctgctgttactcaacacaccaggc cacggaaaatacgcggagtcttcgctctggacattctgttgtgctcggaaggcacgaacc ttcgtgagagtccagatggccagggtggtgcctgggagcgaagctggcttcgtggtaaaa ggagaggcgggaacaggaacgcgctggaaccgcagggctcccgccgtcttgaaaggcctc tggcgtggaaagcagaccatccgagccatcctgggaggctcagcctccaccacccatgtg ggcaccttctgcattacctacaatagccagcctgagaggagcagcaagccctctgcaggc cctcgggacacgtaa >gi568815596r:108796910_109031014|GENSCAN_predicted_peptide_7|226_aa NFCKGGFNPNIFCSGIYLQSFEIFDSSREKIQRDWALLNPWWKVKNRASQILFSGKTILK AILFGFLNRYLSASAVFFLRTSGDAQMFTEGMLRVTTVLVLGTRSVGWGEKERGRRFKVD SVGALKTEGCQVRLKTEPPLGIRGAEHMGSSTTPKPWELDKGAQLIPLGFSYTSHSCSNE TLSTLADKEKVIRTKQETYNGQLEQYRHVEDVTCICCMNKHMNMMA >gi568815596r:108796910_109031014|GENSCAN_predicted_CDS_7|681_bp aatttttgcaaaggtggtttcaaccccaacatcttctgttcaggaatctacctgcaaagt tttgagattttcgacagtagccgtgaaaagattcaaagagactgggcacttctcaatccc tggtggaaggtgaaaaatcgtgcctcacagattctcttctcaggaaagacaatcctgaaa gcgattttgttcggatttcttaaccgctatttgtctgcttctgctgtcttctttttaagg acctcgggggatgcccagatgtttactgagggcatgctacgtgtcaccactgtcctggtg ctagggacacgatcagtgggctggggggaaaaggagcgtgggaggagattcaaggtagat tctgtgggggctctcaagaccgaaggatgccaggtacgcctgaagacagagccaccgctg ggaatcaggggcgctgaacacatgggcagctccacgactccgaagccctgggaattggat aaaggcgcacagttgattcctttaggattttcttatacgagccactcatgtagcaatgaa actctgagcacccttgctgataaggagaaagttatcagaaccaaacaggaaacctacaat gggcaactggaacagtaccggcacgtagaagacgtgacgtgcatttgctgcatgaataaa cacatgaacatgatggcatag